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ABSTRACT 


This  thesis  addresses  object  oriented  programming  languages:  and  a  restrictive 
definition  of  object  oriented  programming  languages  is  presented  and  defended. 
Differences  between  programming  languages  are  discussed  and  related  to  interactive 
integrated  programming  environments.  Topics  related  to  user  friendly  interface  to  the 
computer  system  and  modem  programming  practice  are  discussed.  The  thesis 
especially  addresses  features  in  object  oriented  programming  languages  that  are 
important  when  a  user  friendly  interactive  integrated  programming  environment  is 
designed.  Some  future  research  areas  are  suggested. 
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I.  INTRODUCTION 


A.  BACKGROUND 

Traditionally  programming  languages  have  evolved  towards  a  higher  level  of 
abstraction  offered  to  the  programmer.  Current  programming  languages  have  removed 
the  programmer  from  the  hardware  level  of  the  machine,  and  offered  him  her  increased 
semantic  power  of  the  language  which  better  captures  the  programmer's  concept,  but  it 
is  not  yet  normal  to  work  in  a  fully  interactive  integrated  programming  environment. 

The  problems  we  want  to  solve  with  computers  steadily  increase  in  size  and 
complexity.  We  often  talk  about  "the  software  crisis,"  and  this  can  be  viewed  as  a  sign 
that  we  are  reaching  the  limit  of  what  we  currently  are  able  to  handle.  The  complexity 
barrier  is  pushed  further  and  further,  but  we  still  need  to  create  the  software  we  need  in 
the  future  to  solve  these  difficult  problems.  Traditionally  the  tools  in  the  programming 
environment  have  been  made  by  programmers  for  the  benefit  of  programming.  The 
management's  need  for  tools  has  therefore  not  fully  been  recognized  by  developers  who 
began  the  implementation  of  interactive  integrated  programming  environments. 


B.  OBJECTIVES 

The  objective  of  the  thesis  is  to  show  some  of  the  aspects  that  are  relevant  in  the 
development  of  interactive  integrated  programming  environments,  especially  hew 
object  oriented  programming  languages  can  make  our  work  easier  and  more  efficient. 


C.  THE  RESEARCH  QUESTIONS 

Wrhat  is  an  object  oriented  programming  language,  and  what  is  different 
compared  to  other  programming  languages?  How  can  we  develop  a  more  user  triendh 
programming  environment?  W'hat  makes  it  user  friendly?  W'hat  kind  ol  programming 
environments  do  we  have  today?  Are  any  programming  languages  better  than  ethers 
to  build  user  friendly  interactive  integrated  programming  environments? 
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D.  SCOPE  AND  ASSUMPTIONS 


1.  Scope 

The  relevant  material  for  this  thesis  is  vast,  so  a  brutal  restriction  of  the 
subject  has  been  performed.  The  discussion  is  mostly  kept  on  a  single  user  system  level 
in  order  to  reduce  the  complexity.  Typical  representatives  from  each  major  type  of 
programming  languages  are  studied,  and  used  for  comparisons.  The  sample  programs 
in  the  appendices  are  all  written  in  programming  languages  available  on  persona! 
computers,  e.g.  Apple's  Macintosh. 

2.  Assumptions 

The  thesis  assumes  that  the  reader  has  some  basic  knowledge  of  computer 
science,  therefore  the  commonly  used  expressions  are  not  defined  here.  When  it  comes 
to  the  discussion  of  object  oriented  programming  languages,  and  inheritance,  no 
background  knowledge  is  assumed.  The  subjects  are  covered  more  in  depth  with 
explanations  of  new  concepts.  The  discussion  is  based  on  the  relevance  to  designers 
and  programmers,  not  so  much  on  management's  needs. 

The  future  seems  to  expose  a  growing  number  of  people  to  computer  systems, 
and  working  environments  where  computer  systems  are  an  integrated  part  of  the 
whole.  Therefore  the  discussion  concentrates  on  the  impact  on  the  average  user  who  is 
not  necessarily  a  computer  specialist. 


E.  LITERATURE  REVIEW  AND  METHODOLOGY 
1.  Literature  Review 

This  thesis  is  a  review,  combining  work  from  many  sources.  Most  of  the 
literature  comes  from  the  academic  environment,  and  each  source  normally  covers  only 
a  small  research  area.  Especially  when  it  comes  to  literature  about  object  oriented 
programming  languages  there  seems  not  to  be  a  clearly  defined  terminology. 


2.  Methodology 

The  methodology  used  is  based  on  an  extensive  study  of  literature  available  in 
different  areas:  programming  languages,  software  engineering,  cognitive  science, 
computer  science,  human  interfaces,  etc.  The  purpose  of  the  study  is  to  get  a  feeling 
for  what's  involved  in  interactive  integrated  programming  environments. 

Small  sample  programs  of  the  Tower-Of-Hanoi  problem  are  written  in 
different  programming  languages  (i.e.  Prolog,  Pascal,  Lisp,  and  Smalltalk)  in  order  to 
get  "hands  on"  experience,  and  to  better  understand  the  differences  between  the 
languages  and  environments. 


F.  SUMMARY  OF  FINDINGS 

The  thesis  concludes  that  the  structure  of  the  language  defines  the  boundaries  of 
the  thought  of  a  human  being,  and  that  this  is  valid  also  for  programming  languages 
and  programming  environments.  Object  oriented  programming  languages  have  four 
features:  information  hiding,  data  abstraction,  dynamic  binding,  and  hierarchy  of 
inheritance.  This  kind  of  object  oriented  programming  language  is  well  suited  for 
building  interactive  integrated  programming  environments  that  are  user  friendly. 
Today  interpreted  programming  languages  like  Lisp  and  Smalltalk  are  the  languages 
that  most  easily  facilitate  customizing  of  an  interactive  integrated  programming 
environment  to  the  user's  needs. 


G.  ORGANIZATION  OF  THE  STUDY 

The  first  chapters  discuss  various  programming  languages,  especially  the 
differences  between  them.  The  traditional  programming  languages  are  not  covered  in 
depth,  but  object  oriented  languages  (Smalltalk)  are  discussed  in  more  detail.  The 
thesis  establishes  what  minimum  criteria  a  programming  language  must  have  in  order 
to  be  a  real  object  oriented  programming  language.  Some  of  these  criteria  are  covered 
in  more  detail  in  order  to  give  a  better  understanding  of  what  object  oriented 
programming  languages  have  to  offer  the  designers  programmers  compared  to  other 
.anguages. 
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Next  the  comparison  is  brought  a  step  further,  and  looks  upon  the  interaction 
and  integration  in  some  sample  programming  environments.  Some  of  the  features  in 
the  environments  are  covered  in  more  depth  to  bring  forward  what  has  been,  and  stiil 
is.  important  for  the  evolution  of  the  interactive  integrated  programming  environments. 
The  thesis  ends  up  with  conclusions,  and  some  recommendations  for  future  interactive 
intearated  programming  environments. 


II.  INTRODUCTION  TO  PROGRAMMING  LANGUAGES 


A.  BACKGROUND  FOR  PROGRAMMING  LANGUAGES 
I.  What  is  a  Programming  Language? 

The  history  of  programming  languages  goes  back  to  1846  when  Lady  Lovelace 
programmed  Charles  Babbage's  machine  [Ref.  1].  In  doing  this  she  showed  that  she 
was  thinking  of  a  symbolic  system  as  a  language.  Many  would  say  that  it  was  her 
knowledge  of  mathematics,  and  not  her  knowledge  of  poetry,  that  led  her  to  this 
abstraction.  Mathematical  entities  are  abstractions  that  do  not  change  over  time,  and 
the  math  nautical  theories  are  well  accepted  and  understood. 

Computer  programming  is  a  complex  human  activity,  and  the  programming 
language  is  the  tool  used  to  get  the  hardware  to  do  what  the  programmer  wants  done. 
Programming  languages  have  changed  over  time.  In  the  beginning  it  was  machine 
language,  but  as  the  complexity  of  the  tasks  we  wanted  the  computer  to  solve  for  us 
grew,  we  got  symbolic  assemblers,  higher  level  languages,  and  symbol  manipulation 
languages.  We  have  pushed  the  complexity  barrier  further  and  further  as  human  beings 
have  tried  to  understand,  and  write,  programs  an  order  of  magnitude  larger  then  what 
has  been  feasible  previously. 

The  focal  point  in  rhe  computer  science  problem  solving  process  is  still  the 
programming  language  and  the  programming  environment.  Features  in  the 
programming  language  can  affect  the  way  a  programmer  approaches  the  design  of  a 
solution  to  a  particular  problem.  A  linguistic  theory,  the  Sapir-Whorf  hypothesis 
[Ref.  2],  states  that  the  structure  of  a  language  defines  the  boundaries  of  the  thought  of 
a  human  being.  There  is  a  strong  interaction  between  languages  and  thought.  The 
structure  a  language  presents  for  manipulating  words  and  the  vocabulary  available  for 
representing  ideas  constrain  the  thoughts  that  can  be  easily  and  accurately  represented. 
In  addition  the  structures  and  patterns  that  characterize  people's  thought  process  affect 
how  they  are  able  to  use  the  facilities  provided  by  a  programming  language.  In  other 
words,  a  limited  programming  language  will  be  a  handicap  for  the  programmer  who 
tries  to  realize  his  her  full  problem  solving  potential,  so  that  he  she  must  improvise  to 
get  an  acceptable  solution  to  the  problem  he  she  wants  to  solve. 


The  enormous  variations  in  symbols,  constructs,  and  syntax  observed  among 
natural  languages  is  also  true  of  computer  programming  languages.  Computer 
programming  language  differences  range  from  the  long  compound  words  of  Cobol  to 
the  symbolic  brevity  of  APL.  from  the  massive  size  of  Ada  to  the  compactness  of 
Pascal. 

2.  The  Purpose  of  a  Programming  Language 

Programming  languages  are  used  to  write  programs  in  order  to  get  some 
computer  hardware  to  perform  a  useful  function.  These  programs  have  a  dual  function, 
communication  between  human  and  machine,  but  also  human  to  human 
communication.  A  programming  language  must  therefore  provide  all  the  necessary 
interfaces  with  the  hardware  of  the  computer  system;  at  the  same  it  time  must  also  be 
able  to  capture  the  ideas  of  the  programmer.  High  levels  of  abstraction  increase  the 
semantic  power  of  the  programming  language,  and  capture  better  the  problem  solving 
concepts  of  the  programmer.  Programming  is  not  a  branch  of  mathematics:  it  is  a 
unique  form  of  communication  in  which  human  beings  take  an  active  role  and 
machines  often  a  passive  role. 

The  programming  language,  with  its  structure,  can  help  us  define  the 
boundaries  of  our  thoughts.  We  can  tailor  a  language  to  suit  our  special  needs,  such  as 
APL  for  mathematics  or  special  database  languages  (query  languages)  for  large 
collections  of  data,  in  order  to  reduce  the  distance  between  the  user  and  the  way  the 
user  thinks  about  the  problem. 

One  way  of  classifying  programming  languages  is  by  the  extent  to  which  they 
force  one  to  write  in  machine  level  procedures,  rather  than  in  natural  languages.  This 
scale  runs  from  machine  languages  and  assembly  languages,  through  high  level 
programming  languages,  througn  query'  languages,  to  natural  language.  As  one  moves 
up  the  scale,  the  structures  in  the  programming  language  take  over  more  and  more  of 
the  details  of  integrating  the  software  with  the  hardware.  Programming  languages  at 
the  upper  end  of  the  scale  make  computers  more  accessible  to  more  people,  mainly 
because  they  are  not  forced  to  understand  the  hardware  in  order  to  interact  with  the 
computer  system. 


3.  What  are  the  Criteria  for  "Good"  Programming  Languages? 

A  perfect  programming  language  must  be  ideally  suited  for  all  situations,  for 
all  users,  for  all  applications,  and  for  all  computer  systems.  Today  the  programming 
language,  in  addition,  must  be  unambiguous,  because  the  current  state  of  technology 
has  problems  letting  the  context  decide  the  accurate  meaning  of  the  statements. 

To  design  a  language  that  is  easy  to  understand  and  use,  and  at  the  same  time 
is  powerful,  we  must  accept  a  tradeoff  between  the  principles  such  as  the  following 
from  MacLennan's  list  [Ref.  3:  p.  526-527): 

1.  Abstraction:  Avoid  requiring  something  to  be  stated  more  than 
once;  factor  out  the  recurring  pattern. 

2.  Automation:  Automate  mechanical,  tedious,  or  error  prone 
activities. 

3.  Defense  in  depth:  Have  a  series  of  defenses  so  that  if  an  error  is 
not  caught  by  one,  it  will  probably  be  caught  by  another. 

4.  Information  hiding:  The  language  should  permit  modules  designed 
so  that  (1)  the  user  has  all  of  the  information  needed  to  use  the 
module  correctly,  and  nothing  more;  (2)  the  implementor  has  all  of 
the  information  needed  to  implement  the  module  correctly,  and 
nothing  more. 

5.  Labeling:  Avoid  arbitrary  sequences  more  than  a  few  items  long; 
do  not  require  the  user  to  know  the  absolute  position  of  an  item  in 
a  list.  Instead,  associate  a  meaningful  label  with  each  item  and 
allow  the  items  to  occur  in  any  order. 

6.  Localized  cost:  Users  should  pay  for  what  they  use;  avoid 
distributed  cost. 

7.  Manifest  interface:  All  interfaces  should  be  apparent  (manifest)  in 
the  syntax. 

8.  Orthogonality:  Independent  functions  should  be  controlled  by 
independent  mechanisms. 

9.  Portability;  Avoid  features  or  facilities  that  are  dependent  on  a 
particular  machine  or  a  small  class  of  machines. 

10.  Preservation  of  information:  The  language  should  allow  the 
representation  of  information  that  the  user  might  know  and  that  the 
compiler  or  interpreter  might  need. 

11.  Regularity:  Regular  rules,  without  exceptions,  are  easier  to  learn, 
use,  describe,  and  implement, 

12.  Security:  No  program  that  violates  the  definition  of  the  language, 
or  its  own  intended  structure,  should  escape  detection. 
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13.  Simplicity:  A  language  should  be  as  simple  as  possible.  There 
should  be  a  minimum  number  of  concepts  with  simple  rules  for  their 
combination. 

14.  Structure:  The  static  structure  of  the  program  should  correspond 
in  a  simple  way  with  the  dynamic  structure  of  the  corresponding 
computations. 

15.  Syntactic  consistency:  Similar  things  should  look  similar;  different 
things  should  look  different. 

16.  Zero-one-infinity:  The  only  reasonable  numbers  are  zero,  one,  and 
infinity. 

It  is  at  the  present  time  not  possible  to  design  a  programming  language  that  is  ideal  in 
all  situations,  for  all  users,  for  all  computer  systems,  for  all  applications,  even  if  we  do 
have  the  principles  as  in  the  above  list.  Today's  programming  languages  are 
specialized  for  specific  areas  of  use,  in  order  to  keep  the  size  and  complexity  within  the 
human  being's  limit.  Some  high  level  programming  languages,  like  PL  1  and  Ada,  are 
designed  to  cover  a  wide  area  of  applications,  but  the  cost  is  increased  size  and 
complexity. 


B.  INTERFACES  IN  A  PROGRAMMING  ENVIRONMENT 
l.  Interfaces  in  Programming 

The  environment  in  which  a  programmer  performs  his  her  task  includes:  the 
physical  environment,  the  presence  or  absence  of  other  people,  the  personalities  of  the 
other  members  of  the  group,  directives  from  management,  learned  programming 
methodologies,  reference  manuals  for  the  programming  language  and  the  computer 
system.  All  of  these  affect  the  programmer  in  his,  her  job.  The  term  "programming 
environment"  can  also  be  used  more  specifically,  namely  for  a  set  of  computerized  tools 
which  ease  the  communication  between  the  human  being  and  the  computer  system 
[Ref.  4:  p.  559].  There  are  several  interfaces  in  a  programming  environment  that  are 
important  for  determining  how  the  human  being  thinks  and  reacts  [Ref.  4:  p.  142]: 

1.  Between  the  user's  conceptualization  of  the  actual  world  he  she  wants  to 
represent  and  the  programming  language  in  which  the  user  must  describe  this 
world  so  that  the  computer  system  can  simulate  it. 

2.  Between  the  programming  language  and  the  visual  presentation  of  the  language 
to  the  user. 


3.  Between  the  visual  presentation  of  the  language  and  the  way  the  user  must 
physically  indicate  what  action  should  take  piace. 


Figure  2.1  Interfaces  in  a  Programming  Environment. 


These  three  interfaces  are  visualized  in  Figure  2.1  where:  A  is  the  real  polar 
bear  to  the  simulated  polar  bear,  B  is  the  user  interface  to  the  computer,  and  C  is  the 
simulation  described  in  the  computer. 

The  features  of  a  programming  language  are  the  working  tools  for  the 
programmer.  The  programmer's  work  in  finding  a  solution  »  affected  by  the  tools 
available  to  him.  her.  The  interfaces  are  part  of  this,  and  r  ead  the  programmer 
towards  certain  problem  solving  methodologies,  but  it  migh>  ‘^at  it  is  not  the  best 
methodology  for  this  specific  situation.  People  normally  spend  mu„  time  describing 
data  manipulation  than  they  do  describing  control  flow  |Ref.  5:  p.  184-215], 
Traditional  programming  languages  on  the  other  hand  provide  for  the  development  of 
large  control  structures  with  embedded  data  manipulation.  The  natural  human 
tendency  seems  to  be  to  start  with  data  manipulation  and  add  control  flow  as  an 
afterthought.  Miller  (Ref.  5]  concludes  that  natural  language  is  not  adequate  for 
procedural  specifications,  but  that  a  limited  and  structured  subset  of  natural  languages 
might  be  more  effective,  and  make  the  human-computer  interface  more  friendly. 
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2.  Dimensions  in  User  Interfaces 

The  programming  environment  must  be  accessible  to  the  human  being  in 
order  to  be  of  any  use.  The  interface  between  the  human  and  the  system  is  complex, 
and  consists  of  a  large  number  of  design  decisions.  The  most  important  interfaces  are 
listed  next,  with  some  relevant  questions  added  [Ref.  6:  p.  13J: 

1.  Presentation;  how  are  objects  displayed  and  selected?  Does  the  system  translate 
structure  into  text.  i.e.  pretty  printing?  Can  it  give  more  than  one  representation 
of  the  same  object,  i.e,  multiple  views? 

2.  Command  interface;  how  are  commands  invoked?  Are  the  menus  context 
sensitive  or  not?  Select  then  command,  or  command  then  select?  Binding 
commands  to  keys? 

3.  Extensibility;  how  can  a  user  tailor  the  system  to  his/her  needs?  Does  the 
system  support  extensions?  What  is  the  performance  of  extensions?  What  kind 
of  mechanisms  are  used  for  the  extensions? 

4.  Window  systems;  what  is  the  underlying  technology  for  implementing  the 
interface?  What  is  accepted  in  the  windows?  How  does  the  system  support  the 
windows?  Is  the  window  system  compatible  with  other  systems? 

The  list  is  only  meant  to  give  an  idea  of  the  complexity  involved  when 
designing  a  user  interface.  The  question  of  how  to  present  a  programming 
environment  to  the  user  is  not  yet  fully  solved.  Both  the  technology  and  the 
methodology  are  immature,  and  we  have  many  contenders  in  the  field.  A  very  good 
example,  of  new  ideas  in  user  interfaces,  is  the  Macintosh  personal  computer  produced 
by  Apple  Computer  Inc.  This  system  shows  how  it  is  possible  to  give  a  novice  user 
access  to  a  powerful  personal  computer  system.  The  Macintosh  is  in  many  ways  an 
interactive  environment  that  lets  the  user  stay  in  control  throughout  the  session.  Ail 
applications  written  for  the  Macintosh  system  are  supposed  to  follow  a  standard 
(defined  by  Apple)  user  interface  in  order  to  reduce  the  learning  time  for  the  user.  It  is 
of  course  possible  to  violate  these  standards  if  one  wants  to. 


C.  THE  SEARCH  FOR  A  BETTER  SOLUTION 

1 .  Procedural  versus  Nonprocedural  Programming  Languages 

In  the  early  days  of  computers,  a  few  decades  ago.  the  job  for  the  designers 
and  programmers  was  to  convert  manually  existing  systems  to  new  technologies  using 
computers.  Today  we  still  convert  and  refine  existing  systems,  but  increasingly  the  job 
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is  to  apply  technology  to  do  something  new,  things  we  never  dreamed  the  previous 
system  could  do.  When  we  are  not  replacing  an  existing  system,  design  no  longer 
follows  directly  from  analysis.  Analysis  of  what  exists  yields  insufficient  information  to 
design  what  will  come  to  exist,  so  analysis  and  design  become  inseparable.  The 
problems  we  seek  to  solve  using  computer  systems  become  more  and  more  complex. 
The  traditional,  obvious  applications  for  computer  systems,  have  already  been  done 
ar.d  newer  applications  are  often: 

1.  More  complex. 

2.  Less  obvious. 

3.  Larger. 

4.  Used  for  longer  period  of  time. 

5.  More  likely  to  change  over  time. 

This  problem  complexity,  and  the  necessity  to  deal  with  many  different  details  at  one 
time,  makes  programming  generally  very  difficult.  We  are  trying  to  master  this 
complexity  by  applying  what  has  worked  for  us  in  the  past.  The  programming 
language  designers  have  realized  this,  and  have  given  us  temporary-  relief,  but  every 
time  the  problems  we  want  to  solve  outgrow  the  current  programming  technique. 
Symbolic  assemblers,  higher  level  languages,  and  symbol  manipulation  languages  have 
in  turn  pushed  the  complexity  barrier  back  [Ref.  4],  Programming  language  design  is  a 
cumulative  learning  process,  and  programming  is  still  a  very  young  branch  of 
engineering.  The  evolution  of  programming  languages  has  resulted  in  solutions  to  a 
broader  class  of  problems,  and  even  new-  approaches  toward  the  solution  of  presently 
unsolved  problems. 

We  have  different  considerations  that  dictate  the  design  of  programnur.g 
languages  [Ref.  3:  p.  523]: 

1.  Uses  (problems  solved). 

2.  Users. 

3.  Computers  on  which  the  programming  language  can  be  implemented. 

4  Successes  and  failures  of  the  designs  of  the  past. 

T he<e  different  considerations  show  that  it  is  very  difficult  today  to  construct  a  single 
language  that  can  cover  all  possible  needs,  even  if  we  have  the  well  defined  design 
principles  stated  earlier  in  this  chapter. 

The  search  for  a  better  programming  language  has  given  us  a  wide  variety  of 
du'eren:  languages,  and  dialects  of  languages.  A  nonprocedural  programming 
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language  is  one  that  lets  the  programmer  concentrate  on  "what"  he  she  wants  the 
program  to  do,  instead  of  "how"  to  do  it.  Related  to  this  is  the  separation  of  the  logic 
component  and  the  control  component  within  the  language.  Examples  are  Prolog  and 
Smalltalk,  and  this  thesis  will  cover  them  in  more  detail  later.  Procedural  programming 
languages  are  the  more  conventional  languages,  in  which  the  programmer  has  full 
responsibility  for  the  control  component.  Examples  are  Ada  and  Lisp,  which  also  will 
be  covered  in  more  detail  later. 

The  terms  procedural  and  nonprocedural  will  be  discussed  in  more  detail  later. 
No  well  defined  and  agreed  upon  definition  exists,  but  examples  from  different  types  of 
programming  languages  will  be  used  to  clarify  the  differences. 

The  procedural  languages  are  often  divided  into  two  subclasses.  Imperative 
and  Applicative  languages.  Imperative  programming  languages  includes  most  of  the 
traditional  languages  (eg.  Fortran,  Cobol,  Pascal),  but  not  languages  like  Lisp,  Prolog, 
and  Smalltalk.  Prolog  and  Smalltalk  are  not  applicative  languages  either.  Imperative 
languages  depend  heavily  on  an  assignment  statement  and  a  changeable  memory  for 
accomplishing  a  programming  task.  Most  of  these  languages  are  basically  a  collection 
of  mechanisms  for  routing  control  from  one  assignment  to  another.  In  an  applicative 
language  on  the  other  hand  the  central  idea  is  function  application,  that  is  to  apply  a 
function  to  its  argument.  A  subset  of  Lisp  can  be  used  as  an  applicative  language. 

2.  What  do  W'e  Want  in  the  Future 

Historically  the  introduction  of  high  level  programming  languages  relieved  the 
designer  programmer  from  the  machine  code  by  introducing  higher  levels  of 
abstractions.  The  future  should  give  us  high  level  programming  environments  that 
provide  help  for  the  designer/programmer  in  understanding  and  manipulating  complex 
software  systems.  The  human  user  should  not  worn'  about  the  detailed  specification  of 
algorithms,  but  rather  work  with  the  description  of  the  properties  of  the  packages  and 
objects  we  use  to  build  programs.  The  programming  environment  should  give  us  a 
higher  level  of  abstraction  so  that  we  can  specify-  behavior,  i.e.  what  to  do  instead  of 


how  to  do  it. 


D.  what  kind  of  help  can  hardware  offer 


1.  Hardware  Cost  and  Performance,  and  Its  Implications 

The  economics  of  data  processing  are  changing  rapidly.  Historically,  the 
hardware  cost  of  a  computer  system  was  so  high  that  concern  with  hardware  efficiency 
was  not  only  justified,  but  essential.  Therefore  programmers  worried  about  the  CPL' 
time  and  memory  space  their  program  code  needed.  Today  the  declining  cost  of 
hardware  makes  developing  and  maintaining  of  many  programs  more  expensive  than 
running  them.  Therefore  emphasis  is  shifting  from  efficiency  on  the  computer  to 
controlling  software  cost  and  user  friendliness.  Software  cost  in  this  context  is  the 
total  accumulated  cost  over  the  whole  life  cycle  of  the  application,  i.e.  problem 
definition,  specification,  design,  coding  and  testing,  implementation,  maintenance,  and 
purging.  The  shift  in  cost  also  affect  the  design  of  programming  languages,  and  at  the 
moment  we  have  a  wide  variety  of  experimental  languages  taking  advantage  of  the 
increased  performance  of  computer  hardware. 

Alan  Kay  [Ref.  3:  p.  453]  in  the  late  1960s  was  convinced  that  in  the  future  it 
would  be  possible  to  put  the  power  of  what  was  then  a  room  sized,  million  dollar 
computer  into  a  small  machine  (personal  computer)  placed  on  a  person's  desk.  He 
asked  himself  what  kind  of  language  would  be  needed  for  this  machine,  and  decided 
that  a  simulation  and  graphics  oriented  programming  language  could  make  the 
computer  power  accessible  to  nonspecialists.  Xerox  Corporation  started  design  of 
Smalltalk  based  on  his  ideas  long  before  suitable  machines  were  around.  Xerox 
developed  Smalltalk  as  a  software  system,  rather  than  creating  a  specific  hardware 
package.  The  experience  gained  by  developing  applications  in  one  Smalltalk  system  was 
used  to  generate  next  generation  of  Smalltalk,  and  so  on.  The  current  system. 
Smalltalk-SO,  was  developed  to  be  adaptable  for  implementation  on  a  large  number, 
and  variety,  of  computer  systems.  This  is  one  of  the  few  cases  w'ere  the  language  is 
ready  before  the  hardware  to  run  it  on. 

Many  of  the  facets  of  programming  are  currently  caused  by  the  way  we 
'adjust'’  the  user  to  the  hardware  available.  The  nature  of  programming  is  going  to 
change  in  the  future,  as  the  computer  technology  matures.  Current  programming 
techniques  are  not  adequate  for  building  and  maintaining  systems  of  the  complexity 
called  for  by  the  tasks  we  attempt  to  solve.  In  the  future  we  need  to  shift  our  attention 
away  from  the  detailed  specification  of  algorithms,  towards  the  descriptions  cf  the 
properties  of  the  modules  and  objects  with  which  we  build  programs.  Already  today 
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the  memory  available  is  so  large  and  so  cheap  (relatively)  that  we  can  see  changes  in 
the  way  people  program.  Higher  efficiency  of  the  hardware  aiso  removes  much  of  the 
work  previously  spent  on  speeding  up  modules  using  assembly  language.  Today  higher 
levei  languages  can  be  used  without  much  worry-  about  efficiency  problems  because  the 
optimized  compilers  have  become  very  good. 

Today  we  are  not  very  good  at  reusing  old  designs  and  modules  (code).  The 
wheel  is  reinvented  many  times  over  because  our  current  programming  languages  and 
methodologies  do  not  enforce  reuse  as  a  resource  saving  method.  Some  of  the 
problems  are  caused  by  the  lack  of  knowledge  of  existing  modules  that  can  be  reused. 
How  do  we  build  libraries  etc.?  Reuse  of  code  is  a  complex  and  not  well  understood 
problem  today.  There  are  many  ways  to  attack  the  problem,  and  this  thesis  looks  at 
what  object  oriented  programming  languages  can  offer  to  reduce  the  problem  of 
reusability.  The  topic  of  reusability  will  not  be  covered  in  depth,  but  also  hardware 
development  makes  a  difference.  Because  of  a  better  performance  to  cost  ratio  of 
computer  hardware  it  is  today  feasible  to  build  libraries  of  general  routines  that  can  be 
used  in  more  applications  with  minor  changes.  General  routines  are  normally  slower 
than  optimized  specialized  routines,  but  because  a  general  routine  can  be  used  more 
often  it  is  also  possible  to  put  in  more  resources  to  optimize  it. 

2.  Visual  Interfaces 

The  human  interface  has  come  a  long  wav  since  the  introduction  of 
computers.  In  the  beginning  even  the  assembly  language  (mnemonics)  was  thought  of 
as  very  "human-'  and  "user  friendly,"  but  today  this  is  viewed  as  primitive.  Today  s 
bitmapped  high  resolution  screen,  with  a  pointing  device  (normally  a  mouse),  gives  the 
user  quite  a  different  interface  to  the  computer  .ystem.  The  growing  capabilities  and 
performance  of  hardware  are  used  to  ease  the  interaction  between  the  human  being 
and  his  her  hardware.  Alan  Kay's  dream  from  the  1960's  is  a  reality  today.  We  do 
have  personal  computers  with  the  power  of  yesterdays  mainframes,  and  new- 
programming  languages  like  Smalltalk  use  simulation  and  graphic  presentation  to  make 
the  system  accessible  to  the  nonspecialist  [Ref.  7],  Systems  like  Macintosh  and  Amiga 
are  examples  of  how  this  has  become  available  to  the  consumer  who  has  little  or  no 
background  in  use  of  of  computers. 
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3.  Firmware 


Traditionally  a  system  could  be  described  by  its  software  and  its  hardware. 
Today  this  is  not  quite  true  because  the  difference  between  software  and  hardware  has 
become  rather  fuzzy.  Everything  that  can  be  done  in  software  can  be  implemented  in 
hardware  and  vice  versa  (not  quite  true  because  we  need  some  hardware  components 
to  run  the  software  on).  The  introduction  of  a  user  friendly  interface  put  a  much 
heavier  load  on  the  computer  resources,  and  firmware  has  therefore  been  implemented 
to  speed  up  execution.  A  typical  example  is  the  Macintosh.  When  it  was  presented  in 
1934  it  hau  128k  in  RAM  and  64k  in  ROM.  The  ROM  is  an  example  of  firmware  that 
contains  very  efficient  routines  that  control  the  user  interface,  and  at  the  same  time  is 
accessible  to  application  programs. 


E.  SUMMARY  OF  THE  CHAPTER 

Computer  programming  is  a  complex  human  activity,  and  the  programming 
language  is  the  tool  used  to  get  the  hardware  to  do  what  the  programmer  wants  done. 
Programming  can  be  viewed  as  a  unique  form  of  communication  in  which  human 
beings  take  an  active  role  and  machines  often  a  passive  role.  The  structure  of  a 
programming  language  defines  the  boundaries  of  the  thought  of  human  beings,  i.e.  the 
programming  ianguage  limits  our  ability  to  solve  problems. 

A  perfect  programming  language  must  be  ideally  suited  for  all  situations,  for  all 
users,  for  all  applications,  and  to:  all  computer  systems.  At  the  present  time  no  such 
perfect  programming  language  exists,  but  we  have  more  specialized  languages  that  keep 
the  complexity  within  the  human  being's  limit. 


25 


III.  HUMAN  LIMITATIONS  AND  RELATED  TOPICS 


A.  BACKGROUND 


More  and  more  people  get  involved  in  tasks  where  a  computer  is  used.  It  is  no 
longer  only  a  small  group  of  specialists  that  perform  design  and  programming.  Already 
today  a  large  number  of  people  use  specialized  application  languages,  like  Lotus  1-2-3, 
in  their  daily  job  situation.  Most  people  do  not  think  of  using  a  word  processing 
package  as  programming,  but  in  reality  it  is  programming  at  a  very  high  level  of 
abstraction  for  a  very  specialized  context.  In  the  future  people  will  have  routine  daily 
interaction  with  computer  systems.  How  do  we  build  such  systems?  What  limitations 
does  the  human  being  have  when  it  comes  to  the  use  of  computer  systems?  This  is  the 
kind  of  questions  we  now,  and  in  the  future,  must  answer.  This  chapter  will  try  to  give 
an  overview  of  some  of  the  features  that  are  involved.  In  addition  some  ideas  from 
David  Lorge  Parnas  are  discussed  in  order  to  show  that  not  only  the  programming 

languages  are  important  for  which  problems  we  are  able  to  solve,  but  also  how  we 

actually  use  the  languages. 

"Cognitive  Science,"  "Cognitive  Psychology,"  and  "Human  Information 
Processing"  all  help  provide  the  conceptual  framework  needed  to  think  about  the 
abilities  and  limitations  of  the  person  designing  or  using  the  computer  system.  There 
are  three  basic  factors  involved  when  a  computer  system  is  designed: 

1.  Know  your  user:  experience,  limitations,  ability,  and  motivation. 

2.  Know  your  user's  task:  visual  and  manual,  what  must  be  done. 

3.  Know  your  user's  working  condition:  where  the  job  is  being  done,  what  it  is  like 

there. 

All  the  three  factors  important  to  the  system  design  involve  the  person  that  is  supposed 
to  use  the  production  system.  The  user  of  the  system  seems  to  be  the  limiting  factor 
for  many  implementations  of  computer  systems.  The  human  being  has  many 
disadvantages,  i.e  we  forget,  we  get  tired,  etc.  On  the  other  hand  we  also  have  many 
advantages  over  computer  systems  (at  least  today),  i.e.  we  are  good  at  recognizing 
patterns,  and  at  settina  a  situation  into  the  correct  context. 
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B.  HUMAN  LIMITATIONS 


1 .  Variations  in  Performance  between  Programmers 

No  commonly  accepted  theory  of  significant  factors  in  programming  or 
program  design  seems  to  exists,  but  there  have  been  some  studies  of  programmer 
performance.  Brooks  [Ref.  8:  p.  737-751]  found  enormous  variations  in  performance 
between  different  programmers  of  comparable  experience  for  the  same  programming 
task.  He  estimated  factors  between  5  and  100,  and  suggested  that  differences  in  the 
strategy  used  by  different  programmers  caused  these  large  variations. 

Perhaps  programming  activity  is  too  complex  a  human  behaviour  to  be 
studied  in  detail,  and  must  therefore  largely  remain  a  mysterious  process.  The  only 
way  to  achieve  the  necessary  knowledge  about  the  programming  activity  is  to 
systematically  study  programming  behavior.  Experiments  have  both  dependent  and 
independent  variables.  Dependent  variables  are  what  you  measure,  and  must  be 
selected  to  capture  the  part  of  the  programming  task  you  are  interested  in.  If  you  use 
several  dependent  variables  to  measure  more  aspects  of  the  performance,  the  sum  total 
of  the  information  (e.g.  time  used,  numbers  of  errors,  design  strategy,  rated  ease  cf  use) 
wiil  give  a  better  picture  of  what's  going  on.  An  independent  variable  can  be  the  menu 
length,  from  which  the  appropriate  choice  must  be  selected.  This  kind  of  variable  can 
be  used  to  measure  programmer  performance.  Human  knowledge  is  by  necessity 
incomplete.  We  cannot  know  in  advance  what  wre  might  be  able  to  knew  and  what 
might  be  essentially  unknowable  [Ref.  9].  We  must  try  to  find  things  out.  or  as 
Einstein  said:  "The  important  thing  is  not  to  stop  questioning." 

Programmer  limitations  affect  how  much  coding  is  lost  because  the 
programmer  did  not  have  full  mastery  of  his  her  computer,  programming  language 
land  environment)  or  himself  herself,  or  a  combination  of  these.  In  addition  the 
programmer  may  be  unaware  of  a  certain  algorithm,  or  unable  to  grasp  a  sufficiently 
large  portion  of  the  problem  at  one  time  to  get  the  overall  picture  of  the  problem 
he  she  wants  to  solve.  The  problem  we  are  facing  today  is  how  to  give  the 
programmer  full  mastery  of  his/her  job,  and  at  the  same  time  give  him  her  the  powerful 
tools  needed  to  solve  the  large  and  complex  problems  we  currently  want  to  find  a 
solution  tc. 
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2.  The  Cost  of  Large  Complex  Software  Systems 

We  have  long  advocated  economy  of  scale  for  manufacturing,  but  does  this 
applv  to  software  systems?  To  produce  a  large  integrated  software  system  takes  a  long 
time,  and  costs  large  amounts  of  resources.  If  we  are  totally  dependent  upon  bug  free 
software,  we  are  at  the  moment  in  deep  trouble.  The  testing  dilemma  was  stated  by 
Dijkstra  [Ref.  10:  p.  6]  as:  "program  testing  can  be  used  to  show  the  presence  of  bugs, 
but  never  to  show  their  absence."  Today  program  testing  is  possibly  the  only  way  to 
ensure  an  acceptable  quality  of  a  complex  program.  Although  a  large  integrated 
software  system  is  more  economical  when  it  is  in  active  use,  it  naturally  costs  more  if 
we  must  take  it  out  of  use  due  to  an  error.  Because  it  is  larger,  there  are  aiso  more 
things  that  can  go  wrong,  so  things  will  go  wrong  more  often.  Finally  because  of  its 
size  it  is  more  difficult  to  find  out  what  is  wrong,  and  it  will  therefore  take  a  longer 
time  before  it  is  running  again.  Human  limitations,  when  it  comes  to  complexity  of 
systems,  seem  to  reduce  the  advantage  of  large  integrated  systems.  Especially  if  the 
modules  depend  upon  each  other,  and  therefore  if  there  is  an  error  in  one.  none  can  be 
used.  Some  advocate  that  systems  like  the  "Strategic  Defence  Initiative"  (SDI)  are 
outside  our  capabilities  when  it  comes  to  software  design  because  of  its  size  and 
complexity.  David  Lorge  Parnas  strongly  opposes  the  SDI  due  to  human  limitations; 
he  said:  "the  state  of  the  art  in  software  is  significantly  behind  that  in  other  areas  of 
engineering."  [Ref.  IT.  p.  1327]  The  lack  of  testability  of  SDI  software  makes  life 
difficult;  he  stated:  "we  would  need  a  software  system  so  well-developed  that  we  could 
have  extremely  high  confidence  that  the  system  would  work  correctly  when  called 
upon."  [Ref.  11:  p.  1328] 

The  bottom  line  is  that  not  only  do  large  integrated  systems  today  cost  an 
enormous  amount,  but  we  also  have  the  "cost"  related  to  trust  in  the  system,  i.e.  do  we 
trust  our  systems  enough  to  let  them  control  our  life?  The  cost  related  to  trust  is  more 
related  to  politics  than  money,  therefore  it  is  less  measurable  and  much  more 
complicated  to  agree  upon. 

3.  User  Interface  Performance  Issues 

Vision  is  our  primary  sense,  but  it  has  its  limitations.  There  are  many  things 
the  eye  does  not  see,  so  the  screen  on  the  terminal  computer  can  take  advantage  of 
this.  The  human  visual  system  has  several  characteristics  that  control  the  use  of 
computer  systems  for  visual  representation  of  data  and  information.  The  human 
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perceives  as  simultaneous  changes  that  occur  in  less  than  20  ms.  And  if  successive 
frames  on  a.  screen  of  a  moving  image  are  redrawn  in  less  than  50  ms.  the  human 
perceives  the  object  as  moving  smoothly.  And  finally  if  feedback  to  user  initiated 
events  are  produced  in  less  than  300  ms.  the  human  perceives  it  as  occuring 
instantaneously.  These  indicated  speeds  represent  limits  for  which  objects  move 
smoothly  and  do  not  interrupt  the  user's  train  of  thought.  The  bottom  line  is  that  the 
human  is  very  slow  compared  to  computers,  so  the  drain  on  computer  resources  to  give 
a  good  visual  presentation  is  little  compared  to  the  advantages  it  can  give.  [Ref.  12:  p. 
P6] 

Colors  used  on  the  screen  also  do  have  an  impact  on  how  we  react  [Ref.  13:  p. 
3-33 j.  Because  vision  is  our  primary  sense  the  designers  of  interactive  programming 
environments  must  take  this  into  account.  Use  of  colors  in  the  interface  often  means 
that  the  resolution  is  poorer  than  with  a  monocrome  screen,  i.e.  there  is  no  such  thing 
as  a  "free  lunch." 

C.  COGNITIVE  SCIENCE 
1.  Human  Memory 

Andrew  Monk  [Ref.  13:  p.  49]  states:  "Human  memory  is  currently  believed  to 
be  a  complex  system  of  independent  storage  systems."  These  storage  systems  have 
different  characteristics.  Long  term  memory  seems  to  use  a  semantic  code  and  stores 
information  in  a  highly  organized  manner.  No  capacity  limit  seems  to  exist  for  this 
storage.  Retrieving  information  seems  to  be  a  process  of  reconstruction,  instead  of  just 
output  of  held  information.  That  means  that  the  output  may  be  different  from  what 
was  put  in.  The  short  term  memory  is  easier  to  measure,  test  and  is  therefore  better 
understood.  It  consists  of  "buffers"  that  holds  the  information  for  a  very  short  period 
of  time  while  it  is  processed  by  the  brain  and  stored  in  long  term  memory.  The  long 
term  memory  seems  to  be  limited  to  only  being  able  to  code  information  that  is 
meaningful  to  the  user.  Therefore  language  designers  and  program  designers  must 
ensure  that  the  user  has  a  mental  model  of  the  system  used.  The  system  environment 
shouid  work  the  way  human  beings  think.  Current  programming  languages  often  are 
designed  to  ease  the  parsing  problem  instead,  i.e.  the  use  of  prefix  constructs  instead  of 
infix  constructs. 
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2.  The  Learning  Process 

Learning  is  a  complex  process  of  integrating  new  knowledge  into  a  structure 
of  what  is  already  known.  Writing  a  program  is  a  process  of  learning,  both  for  the 
programmer  and  the  person  who  is  going  to  use  the  software  product.  This  learning 
takes  place  in  a  context  of  a  particular  hardware  system,  particular  programming 
language  or  programming  environment,  and  depends  also  on  the  society  around  us. 
Much  of  the  information  the  maintenance  programmer  has,  is  in  the  source  code  of  the 
program.  This  is  a  typical  example  of  the  dual  role  of  a  programming  language,  i.e. 
human  to  human,  and  human  to  computer  system,  communication. 

Reading  and  understanding  a  computer  program  listing  is  a  cognitive  task  that 
is  critical  to  the  development  and  maintenance  of  software.  In  the  interaction  with  the 
computer  system  the  user  often  receives  and  sends  information  in  the  medium  of 
written  language.  This  information,  the  words  and  sentences  used,  must  be  related  to 
each  other  for  the  reader  to  be  able  to  understand  the  information  as  a  whole.  The 
reader  is  performing  a  problem  solving  exercise  [Ref.  13:  p.  37).  How  the  human 
understands  this  piece  of  written  information  depends  upon  the  characteristics  of  the 
text,  but  also  it  depends  on  the  programmer's  past  experiments  and  familarity  with  the 
concepts  involved.  Especially  in  the  maintenance  phase,  because  normally  this  person 
is  not  the  person  who  wrote  the  program  in  the  first  place,  it  is  necessary  to  understand 
how  the  program  works  in  order  to  update  or  enhance  it. 

The  construction  of  a  program  can  be  viewed  as  building  a  mapping  from 
some  domain  external  to  the  program  into  the  set  of  objects  and  operations  available 
in  a  particular  programming  language  or  system.  The  task  of  understanding  the 
program  becomes  one  of  constructing  information  about  the  modeling  domains  used  to 
bridge  between  the  problem  and  the  executing  program.  Within  each  domain  there  is 
information  about  the  basic  sets  of  objects,  including  their  properties  and  relationships, 
the  set  of  operations  that  can  be  performed  on  the  objects,  and  the  order  in  which 
these  operations  take  place  [Ref.  14).  Figure  3.1  gives  a  visual  representation  of  a 
"banking"  problem  domain. 

Natural  languages  may  be  the  only  language  that  can  serve  across  multiple 
domains.  The  problem  is  that  natural  languages  are  inconuse  and  overload  main 
words  and  expressions.  Perhaps  we  ultimately  have  to  change  our  ways  of  thinking  a 
little  bit  in  order  to  tit  our  computers,  because  current  technology  does  not  allow 
natural  language  as  the  communication  between  the  human  and  the  machine. 
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'  PROBLEM 


Figure  3.1  Modeling  Domains  in  Programming. 


3.  Thinking  and  Reasoning 

Neil  Thomson  (Ref.  13:  p.  5]  describes  the  human  being:  "Of  all  the  human 
facilities,  our  ability  to  reason  is  the  one  which  appears  to  set  us  apart  from  the 
animals."  Thinking  seems  to  be  a  combination  of  inductive  and  dedueme  reasoning, 
but  at  the  same  time  the  two  are  difficult  to  separate.  The  process  of  arming  at  a 
logically  necessary  conclusion  from  initial  premises  is  deductive  reasoning,  and  it  is  a 
logical  process.  Inductive  reasoning  on  the  other  hand  is  not  a  logical  process,  but 
refers  to  the  production  of  a  general  statement  from  specific  instances.  The  problem  is, 
how  can  one  build  computer  s\  stems  with  a  reasoning  process  as  flexible  as  human 
reasoning.  So  far  we  have  not  been  able  to  do  this,  but  we  have  a  large  number  of 
expert  systems  that  are  used  as  working  tools  in  specialized  domains.  Typically  the 
expert  system  asks  questions  and  the  user  supplies  the  expert  system  with  tiie  facts 
needed.  One  problem  with  expert  systems  is  to  specify  and  quantify  the  knowledge  of 
an  expert  in  a  specialized  field;  another  problem  is  how  to  write  the  program  that  uses 
this  knowledge  in  an  intelligent  wav 


D.  WHAT  IS  "MODERN  PROGRAMMING  PRACTICE"? 


1.  Background 

The  'software  crisis"  has  not  been  solved  yet.  New  languages  have  moved  the 
complexity  barrier  further  and  further,  but  not  really  solved  the  problem  "how  to 
develop  large  and  complex  programs."  David  Parnas  [Ref.  10]  says  we  must  learn  to 
use  the  programming  languages  we  have.  The  problem  is  not  so  much  the  coding  itself, 
as  the  analysis  and  design  of  the  programs  in  order  to  make  the  coding  a  straight¬ 
forward  process.  In  other  words  we  must  understand  the  problem  and  our 
programming  language  in  order  to  be  able  to  solve  new,  large,  and  complex  problems. 

2.  Modern  Programming  Practice 

Modern  programming  practice  is  helpful  in  solving  complex  problems.  This  is 
a  programming  technique  where  more  work  is  put  into  the  analysis  and  design  phase  in 
order  to  save  later  cn  in  coding,  maintenance,  and  enhancements.  David  Parnas  is  a 
strong  supporter  of  this  methodology,  which  emphasizes  the  following: 

1.  Structured  programming. 

2.  Information  hiding. 

3.  Data  abstraction. 

-■  Top  down  design. 

The  basic  principle  of  structured  programming  is  like  the  military  strategy  of  divide  and 
conquer.  A  program,  or  software  module,  is  broken  into  small,  independent,  single 
function  modules  which  are  clear  and  easy  to  follow  logically.  These  modules  are 
themselves  composed  of  even  smaller  blocks:  the  decision,  sequence,  and  repetition 
structures  found  in  most  programs.  The  details  of  structured  programming  will  not  be 
covered  in  depth  because  they  are  of  little  relevance  to  this  thesis,  but  the  other 
elements  of  the  methodology  wall  be  covered  in  more  or  less  detail  in  the  context  of  the 
programming  languages  and  programming  environments  discussed. 

3.  VVhy  use  Modem  Programming  Practice? 

Many  of  the  problems  we  are  working  on  today  are  large  and  complex.  A 
large  number  of  people  and  resources  are  involved,  therefore  we  have  a  need  for 
management  of  the  project.  The  management  is  responsible  for:  allocation  of 
resources,  deadlines,  go  or  no  go  decisions,  funding  etc.  To  be  able  to  take  the 
decisions,  and  make  comparisons  between  different  projects,  it  needs  measurables.  In 
this  context  measurables  are  anything  that  can  be  measured.  Measurables  are 


quantified  data  that  is  used  for  comparisons,  evaluation,  etc.  In  a  production 
environment  the  management  is  normally  the  one  who  specifies  the  policy,  standards 
and  interactive  integrated  programming  environment  to  use.  Many  of  the  tools  in  the 
environment  will  therefore  be  used  for  managing  the  whole  lifecycle,  and  not  only  for 
the  design  code  phase.  There  may  be  a  conflict  of  interest  between  management  and 
other  users  of  the  system  because  of  the  measurables.  Often  technical  people  do  not 
want  to  give  measurables  because  they  are  afraid  of  later  being  confronted  with  them. 

It  is  better  to  be  clear  than  right,  because  if  you  are  clear  someone  might  tell 
you  that  you  are  wrong.  The  system  must  be  managable,  i.e.  measureable  for  the 
management.  In  addition,  maintenance  of  software  is  expensive  and  time  consuming. 
The  best  way  to  minimize  the  maintenance  cost  is  to  design  the  program  with  ease  of 
maintenance  in  mind.  One  way  to  do  this  is  to  use  modern  programming  practice,  and 
one  key  is  information  hiding.  If  this  is  done,  most  changes  wail  be  limited  to  the  logic 
contained  in  a  single  module,  and  the  possible  number  of  interactions  between  the 
modules  decreases  enormously.  Therefore  the  ripple  effect  will  be  minimized,  as 
changes  made  to  an  independent  module  should  have  little  or  no  impact  on  the  logic 
performed  by  other  modules.  Modem  programming  practice  often  results  m  levels  of 
abstractions,  levels  of  protection,  i.e.  programs  that  have  a  layered  structure. 


E.  WHAT  DOES  "FRIENDLY"  MEAN? 

1  L  ser  Friendly  Interfaces 

The  environment  in  which  programmers  work  is  a  rich  and  complex 
environment,  full  of  human  involvement  and  interaction,  change,  and  misleading 
appearances.  In  this  context  friendliness  would  be  the  distance  between  the  things  the 
user  thinks  about  doing,  and  the  things  the  user  actually  can  do  in  the  programming 
environment.  The  three  interfaces,  see  Figure  2.1,  can  give  a  measurement  of  how 
friendly  the  system  is  to  a  user. 

People  are  different,  and  react  differently  to  a  given  stimulus.  This  means  that 
what  is  friendly  to  one  person  may  not  be  equally  friendly  to  another  person.  Generally 
a  visual  representation,  along  with  reduced  interdependencies  among  different  parts  of 
the  system,  makes  a  programming  environment  more  friendly. 


Figure  3.2  L'ser  Friendly. 

User  friendliness  depends  on  how  easily  the  human  can  process  input  from  a 
computer  system,  and  how  easy  the  human  can  output  something  to  the  computer 
system  and  have  it  understood.  See  Figure  3.2  for  a  visual  representation  of  user 
friendliness.  Note  that  we  are  talking  about  the  input  to  the  human  being  and  not  (as 
it  normally  is  represented)  to  the  computer  system.  This  is  done  to  emphazise  the  shift 
m  design  that  are  taking  place  in  some  of  the  more  modern  implementations  of  user 
interfaces,  which  put  the  user  in  control  throughout  the  session.  The  interaction  with 
the  system  is  on  the  user's  premises.  The  trend  is  to  concentrate  more  and  more  on 
user  friendly  interfaces  between  the  user  and  the  computer  system.  On  many  systems 
tins  shows  up  as  windows,  menus,  and  use  of  a  mouse.  The  objective  is  tc  make  the 
process  of  communication  between  the  user  and  his  her  computer  system  more 
intuitive,  i.e.  more  natural  to  the  user. 

2  Interactive  Systems 

A  wide  wariety  of  hardware  can  be  used  in  an  interactive  environment  in  order 
to  help  communicate  information  from  the  user  to  the  computer  system:  kc".  beard, 
mouse,  pointing  device,  speech  etc.  At  the  same  time  there  are  many  ways  the 
^empurer  system  can  communicate  informations  to  the  user.  This  communication  of 


information  must  be  made  such  that  it  is  easy  for  the  user  to  find  the  information 
required,  as  well  as  to  make  it  easy  to  understand  once  found.  An  interactive 
integrated  programming  environment  should  offer  the  user  many  different  structural 
views  of  the  program.  Examples  are:  when  editing  the  programmer  may  want  to  view  a 
program  as  a  parse  tree,  when  reading  as  a  document  (text),  when  debugging  as  a 
control  flew  graph,  and  when  looking  at  the  screen  as  a  visual  image.  Computer 
resources  should  be  used  to  make  this  interaction  as  intuitive  as  possible  to  the  user, 
i.e.  the  user  should  not  have  to  fight  the  system. 

3.  How  do  We  Learn  to  use  an  Interactive  System 

Today  interactive  integrated  programming  environments,  mostly  based  on 
Lisp,  seem  to  be  characterized  by  tools  that  interact  with  each  other  frequently.  The 
programmer  may  easily  switch  between  an  editor,  interpreter,  debugger,  and  compiler. 
Each  of  these  tools  must  know,  to  some  degree,  about  other  tools.  The  user  interface 
must  be  as  natural  as  possible  for  the  human  programmer.  To  achieve  this  the 
environment  must  take  advantage  of  all  the  human's  strong  sides:  ability  to  recognize 
patterns,  visual  sense,  auditory  sense,  etc. 

We  have  a  great  deal  of  knowledge  about  the  human  visual  and  auditory- 
senses.  In  addition  we  have  more  or  less  accepted  theories  about  how  we  think  etc. 
But  at  the  moment  we  do  not  know  how  to  put  all  this  knowledge  together  in  order  to 
produce  the  optimum  interactive  integrated  computer  system.  The  human  being  is 
characterized  by  the  ability  to  change  context  without  loss  of  information.  We  are 
often  working  on  one  problem,  but  suddenly  we  get  an  idea  or  maybe  the  thought  just 
wanders  olf.  and  we  find  ourselves  in  another  context.  If  the  system  we  are  working 
with  could  support  also  this  human  activity  it  would  be  easier  to  learn  and  operate. 
Thus  means  that  the  system  has  to  be  modeless  to  the  user.  A  consistent  user  interface 
can  help  the  user  overcome  some  of  the  problems  he  she  meets  when  being  exposed  to 
a  new  interactive  system,  or  a  new  application  within  an  already  known  interactive 
'Vstem.  The  system  should  let  the  user  be  in  complete  control  rhroughout  the  session, 
md  :  Ter  the  user  the  structural  view  he  she  wants  to  use. 


F.  SUMMARY  OF  CHAPTER 


Today  more  and  more  people  are  exposed  to  computers  in  their  daily  life. 
Therfore  more  computer  resources  must  be  spent  on  the  interface  between  the  user  and 
the  computer  system  in  order  to  reduce  the  learning  problem  for  the  average  user. 

Our  ability  to  solve  large  and  complex  problems  is  not  only  limited  by  our 
programming  languages,  but  also  by  the  way  we  think  about  the  problem.  Modern 
programming  practice,  which  emphasizes:  structured  programming,  information  hiding, 
data  abstraction  and  top  down  design,  can  help  us  use  what  we  currently  have  to  solve 
problems  we  earlier  could  not  handle. 

Currently  we  do  not  know  how  to  put  all  available  information  together  in  order 
to  produce  the  optimum  interactive  integrated  computer  system.  The  user  interface 
must  be  as  natural  as  possible  for  the  human  user. 


IV.  DIFFERENCES  BETWEEN  LANGUAGES 


A.  PROCEDURAL  VERSUS  NONPROCEDURAL 

The  terminology  and  names  used  in  computer  science  are  not  always  clearly 
defined  and  nonambigous;  this  is  true  for  procedural  and  nonprocedural  programming 
languages  also.  In  this  thesis  the  nonprocedural  languages  are  the  ones  that  emphazise 
the  features  that  let  the  programmer  concentrate  on  "what"  he  she  wants  the  program 
to  do.  instead  of  spending  time  and  effort  to  tell  it  "how"  to  do  it  [Ref.  3:  p.  499).  The 
separation  of  the  logic  components  and  the  control  components  within  the  language  is 
also  important  to  nonprocedural  languages  [Ref.  15:  p.  424-436).  The  terms  procedural 
and  nonprocedural  are  relative  terms,  and  many  computer  scientists  would  say  "more 
procedural"  and  "less  procedural"  instead. 

In  logic  programming  languages  like  Prolog,  the  goal  directed  use  of  Horn 
clauses  lets  the  programmer  express  the  facts,  and  removes  the  "how"  part  of  the 
problem.  Prolog  does  have  control  features  built  in  that  the  programmer  can  use.  such 
as  the  "cut"  predicate.  Prolog  will  be  covered  in  more  detail  later. 

Object  oriented  programming  languages  are  in  this  thesis  classified  as 
nonprocedural  programming  languages.  This  claim  will  be  defended  later  in  the  thesis. 
The  discussion  is  more  on  programming  style  than  the  programming  languages 
themselves  because  it  is  possible  to  program  in  a  procedural  style  in  a  nonprocedural 
programming  language  (i.e.  Smalltalk).  In  a  conventional  procedural  language  the 
programs  are  active  and  the  data  elements  are  passive.  The  program  elements  act  on 
the  data  elements.  In  an  object  oriented  programming  language,  like  Smalltalk,  the 
data  elements  are  active  and  respond  to  messages  that  make  them  act  on  themselves. 
Maybe  they  modify  themselves,  or  maybe  modify  or  return  other  objects.  Object 
oriented  languages  will  be  covered  in  more  detail  later. 

The  control  structure  of  procedural  languages  determines  the  order  in  which 
actions  take  place  within  the  program.  The  statements  within  the  program  must  be 
executed  in  the  specified  order  to  ensure  correctness.  This  implies  a  very  close 
relationship  between  the  control  structures  and  the  actual  logic  of  the  program. 
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Theoretically  all  programming  languages  are  equally  powerful  because  they  all  build  on 
machine  code,  and  they  all  can  represent  a  Turing  machine.  For  human  beings  this 
seems  not  to  be  true,  but  this  is  only  because  the  programming  languages  do  not 
equally  well  support  the  way  people  think,  and  often  different  languages  are 
inappropriately  called  "less  powerful"  than  some  other  language. 


B.  PROCEDURAL  LANGUAGES 
1.  Historical  Background 

Traditional  programming  languages  do  not  really  offer  the  programmer  a  high 
level  of  abstraction,  but  rely  on  a  "word  at  a  time"  programming  style  [Ref.  4:  p.  404], 
The  von  Neumann  architecture  of  the  computer  system,  and  the  sequential  nature  of 
the  traditional  programming  languages  place  limitations  upon  the  level  of  abstraction 
the  language  designer  can  make  available  to  the  programmer.  Programming  has 
proved  to  be  much  easier  if  a  specialized  language,  for  building  similar  applications,  is 
available  to  the  programmer.  The  language  supplies  appropriate  abstractions  for  the 
application  domain,  and  a  programmer  can  simply  select  and  compose  these  high  level 
constructs  to  build  an  application.  The  primitive  constructs  in  the  language  are  then 
application  level  constructs.  These  specialized  languages  can  be  successful  in  a 
specialized  problem  domain,  but  are  generally  less  useful  for  other  applications,  and 
therefore  these  application  modules  are  difficult  to  reuse.  For  a  programming  language 
to  take  advantage  of  both  specialization  and  generality,  it  is  possible  to  build  the 
specialized  language  upon  a  more  general  base  language,  which  must  be  extensible. 

Special  languages  like  the  symbolic  language  APL  can  be  very  compact,  with 
very  high  semantic  power.  If  the  same  program  were  produced  with  Pascal,  a  much 
larger  program  (in  number  of  codelines)  would  be  the  result.  All  languages  have  the 
same  theoretical  power,  and  therefore  the  same  problem  can  be  solved  in  all  the 
programming  languages.  APL  is  less  procedural  than  Pascal,  less  has  to  be  said  about 
how "  and  this  makes  APL  more  compact. 

The  evolution  of  these  languages,  and  especially  the  effort  to  provide  more 
semantic  power,  has  resulted  in  the  development  of  Ada.  This  very  large  and  complex 
language  provides  increased  semantic  power  (higher  abstraction)  at  the  cost  of 
simplicity,  clarity  of  understanding,  and  maybe  the  programmer's  mastery  of  his  her 
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tools  (Ref.  16].  Lisp  is  a  language  with  a  long  history,  and  is  very  popular  in  research 
and  development  communities  because  everything  is  treated  as  lists.  Therefore  it  is 
easy  to  modify  and  extend  to  suit  special  requirements.  The  programmer  is  also  in  this 
case  responsible  for  the  control  structure  of  the  program. 

2.  How  do  we  Cope  with  the  Complexity  of  Programming? 

The  complexity  of  programming  increases  dramatically  with  the  size  of  the 
program,  because  the  possible  number  of  interactions  between  the  modules  that  make 
up  the  program  increases  so  enormously.  How  can  ure  deal  with  this  problem?  What 
has  worked  best  in  dealing  with  unmastered  complexity  in  the  past  is  a  combination  of 
the  following  [Ref.  17:  p.  8]: 

1.  Learning  from  analogous  situations  outside  the  present  situation. 

2.  Learning  how  people  think  and  combine  that  thinking  with  facts  and 
preconceptions  to  determine  actions. 

Traditionally  courses  in  computer  programming  teach  you  to  think  like  a  computer 
when  designing  a  program.  This  view  of  programming  does  not  support  information 
hiding,  which  is  one  of  the  critical  ideas  in  "modern  programming  practice."  In 
procedural  languages  the  programmer  is  responsible  for  the  control  structure  of  the 
program,  and  this  might  be  the  reason  for  "think  Like  a  computer"  programming.  Ada 
is  an  example  of  a  language  that  is  designed  to  incorporate  the  new'  programming 
ideas,  but  still  allows  the  programmer  think  like  a  computer  if  heshe  wants  to. 

Our  ability  to  grasp  a  complex  problem  is  controlled  by  short  and  leng  term 
memory.  Short  term  memory  is  a  workspace  of  limited  capacity  that  holds  and 
processes  those  items  of  information  occupying  one  s  attention.  The  capacity  of  short¬ 
term  memory  is  according  to  Miller  [Ref.  18]  seven  plus  or  minus  two  chunks.  A  chunk 
is  a  single  symbol  or  a  group  that  has  a  single  meaning,  i.e.  telephone  number,  piece  of 
code  etc.  As  programmers  mature  they  observe  more  algorithmic  patterns  and  build 
larger  chunks.  The  scope  of  the  concepts  that  the  programmers  have  been  able  to  build 
into  chunks  provides  one  indication  of  their  programming  ability,  and  it  is  rather  easy 
to  measure. 

3.  ADA  an  Example  of  an  Imperative  Language 

Imperative  languages  depend  heavily  on  an  assignment  statement  and  a 
changeable  memory  for  accomplishing  a  programming  task  [Ref.  3:  p.  344],  Most  of 
these  languages  are  basically  a  collection  of  mechanisms  for  routing  control  from  one 
assignment  statement  to  another  assignment  statement. 


The  Lmted  States  Department  of  Defence  (DoD)  initiated  the  development  of 
Ada  for  the  purpose  of  large  computer  programs  to  be  used  in  embedded  computer 
systems.  Ada  is  a  rich  and  complicated  language,  but  no  subset  is  allowed  by  DoD. 
Since  Ada  is  designed  as  an  integrated,  unified  language  it  is  difficult  to  remove 
features  without  disturbing  the  unity  of  the  remainder  of  the  language.  The  language 
supports  modern  programming  practice,"  and  is  a  block  structured  language  suitable 
for  general  purpose  programming.  Even  if  Ada  has  taken  many  ideas  from  Pascal,  it  is 
not  Pascal,  and  the  design  style  of  programs  will  be  very  different.  Pascal  is  a  language 
for  single,  independent  programs,  while  Ada  is  a  language  for  desigmng  and  creating 
large  software  systems.  It  is  designed  to  heip  create  programs  by  piecing  together 
standard  program  units  from  an  Ada  Library.  Information  hiding  is  supported  by 
modularization  (package),  and  this  helps  simplify  readability,  maintenance,  and 
enhancements.  Abstraction  both  for  data  and  control  is  achieved  with  the  features  in 
Ada. 

The  dependencies  among  program  units  are  explicit  in  Ada  source  code.  The 
structure  of  an  Ada  system  is  not  dispersed  among  the  source  code,  a  run  time 
executive,  and  auxiliary  system  generation  tools.  The  structure  is  completely  defined  by 
the  source  code.  The  Ada  compiler  ensures  that  units  are  compiled  in  a  correct  order, 
and  that  an  executable  system  is  configured  only  from  units  that  are  up  to  date. 

Some  have  expressed  concern  about  Ada.  that  it  may  be  too  large  for  the 
average  programmer  to  learn  completely,  and  that  the  programmer  therefore  will  lose 
mastery  of  his  her  working  tool.  Hoare  in  his  1980  Turing  Award  Lecture  was  critical 
of  the  size  and  complexity  of  Ada.  He  meant  that  the  language  contained  too  many 
features  that  made  it  complicated  to  use  effectively.  Much  of  the  complexity  comes 
from  the  wide  variety  of  features  to  handle  the  same  problem.  It  is  not  like  Lisp  where 
everything  is  a  list,  even  the  program  itself,  or  like  Smalltalk  where  everything  is  an 
object.  The  complexity  of  the  features  in  Ada  may  make  it  more  complicated  to  build 
user  friendly  interactive  integrated  programming  environments. 

4.  Pure  LISP  an  Example  of  an  Applicative  Language 

In  an  applicative  language  the  central  idea  is  function  application,  that  is. 
applying  a  function  to  its  argument  [Ref.  3:  p.  345].  Within  an  applicative 
programming  language,  functions  may  be  defined  explicitly,  conditionally,  recursively, 
or  as  the  composition  of  other  functions.  The  main  point  is  that  these  functions 


operate  only  on  data  (numbers,  characters  etc.).  Functions  that  take  other  functions  as 
arguments  provide  a  higher  level  of  abstraction. 

Lisp  is  one  of  the  oldest  languages  around,  second  only  to  Fortran.  It  was 
developed  in  the  late  1950s  out  of  a  need  for  artificial  intelligence  programming.  In  this 
kind  of  application  the  interrelationship  of  data  and  information  must  be  represented. 
The  resuit  is  that  the  pointer,  and  in  particular  linked  list  structures  are  natural  data 
structuring  methods  [Ref.  3:  p.  341J.  Standard  Lisp  is  not  a  pure  applicative  language 
since  it,  among  other  things,  has  an  assignment  operation:  "setq."  To  get  a  pure 
applicative  language,  a  subset  of  Lisp  can  be  used,  namely.  Lisp  without  'setq.'' 
"rplaca,"  and  rplacd,"  but  also  with  "eq"  restricted  to  atoms. 

An  applicative  subset  of  Lisp  encourages  higher  levels  of  abstractions  than 
conventional  Lisp,  because  the  assignments  are  hidden  from  the  user.  The  assignments 
may  well  exist  on  a  lower  level,  but  they  are  hidden  from  the  higher  levels  of 
abstractions.  An  applicative  programming  language  makes  it  easier  to  integrate 
different  tools  into  a  system  because  the  problem  of  side  effects  is  avoided. 
MacLennan  [Ref.  19],  among  others,  have  suggested  combination  of  features  from 
value  oriented  (.applicative)  programming  and  object  oriented  programming  in  order  to 
build  interactive  integrated  programming  environments. 

Lisp  is  a  symbol  manipulation  language,  where  the  fundamental  objects  are 
called  atoms.  Groups  of  atoms  form  lists.  An  atom  is  a  sequence  of  alphanumeric 
characters:  a  list  is  a  sequence  of  atoms  and  other  lists,  enclosed  in  parentheses.  Lists 
themselves  can  be  grouped  together  to  form  higher  levels  lists.  The  ability  to  form 
hierarchical  groups  (e.g.  lists  of  lists)  is  of  fundamental  importance  to  the  language 
[Ref.  20:  p.  2].  Atoms  and  lists  collectively  are  called  symbolic  expressions.  Symbol 
manipulation  is  often  called  iist  processing. 

Lisp  represents  both  the  data  and  programs  by  the  use  of  lists.  This  uniform 
way  ;o  represent  everything  in  a  program  makes  Lisp  ideally  suited  for  interactive 
integrated  programming  environments,  because  everything  can  be  treated  as  lists.  A 
symbolic  manipulation  program  uses  symbolic  expressions  to  remember  and  work  with 
data  and  procedures. 

Lisp  and  Smalltalk  both  perform  dynamic  type  checking  as  opposed  to  the 
static  type  checking  in  Pascal  and  Ada.  Both  Lisp  and  Smalltalk  will  be  discussed 
mere  in  detail  later,  especially  how  they  are  used  in  interactive  integrated  programming 
environments.  We  will  see  that  dynamic  binding  today  makes  it  easier  to  build  and 
maintain  interactive  integrated  programming  environments. 


41 


The  sample  program  Tower-Of-Hanoi  in  Appendix  C  is  written  :n  Lisp 
[Ref.  20:  p.  88-90], 


C.  NONPROCEDURAL  LANGUAGES 
1 .  Historical  Background 

A  pure  nonprocedural  programming  language  would  be  one  in  which  the 
programmer  only  had  to  state  what  was  to  be  accomplished,  and  leave  it  to  the 
computer  system  to  determine  how  it  was  to  be  accomplished.  Normally  this  kind  of 
programming  is  achieved  using  a  higher  level  of  abstraction  that  removes  the  user  from 
the  trivial  part  of  specifying  control  flow  etc.  Prolog,  which  stands  for  "programming 
in  logic."  was  developed  in  France  [Ref  3:  p.  500]  in  1972.  Since  then  a  number  of 
interpreters  and  compilers  have  been  developed  for  a  number  of  different  computer 
systems.  Prolog  is  becoming  more  and  more  popular  for  logic  programming,  and  some 
even  think  it  will  replace  Lisp  for  artificial  intelligence  programming  in  the  long  run.  It 
was  chosen  as  the  language  the  Japanese  will  use  on  their  5'th  generation 
supercomputers.  Although  there  are  many  logic  programming  languages,  this  thesis 
will  investigate  Prolog  since  it  is  a  typical,  and  well  known,  representative  of  logic 
programming  languages. 

Object  oriented  languages  like  Smalltalk  are  another  typical  but  less  known, 
example  of  nonprocedural  languages.  Many  computer  scientists  believe  that  object 
oriented  programming  is  the  only  possible  way  to  go  if  we  want  a  reliable 
implementation  of  a  very  large  and  complicated  software  system  (like  SDI).  This  is 
mainly  because  we,  at  the  moment,  are  not  able  to  perform  a  convincing  test  of 
conventional  written  programs.  Exhaustive  testing  is  not  possible  due  to  the  enormous 
number  of  possible  interactions  between  modules.  The  only  way  today  to  prove 
correctness,  i.e.  the  absence  of  bugs  in  the  program,  is  by  exhaustive  testing.  In  object 
oriented  programming  the  testing  problem  is  reduced  to  testing  the  objects.  Everything 
:s  treated  as  objects,  it  is  therefore  easy  to  add  to  or  change  a  program.  The  class 
structure  of  an  object  oriented  language,  like  Smalltalk,  prevents  objects  from  making 
too  many  assumptions  about  the  internal  behavior  of  other  objects. 


2.  Prolog  an  Example  of  Logic  Oriented  Language 

Prolog  builds  on  first  order  predicate  logic  [Ref.  21:  p.  14).  Computer 
languages  try  to  do  things,  but  logic  just  savs  that  certain  things  are  true  or  false. 
Prolog  looks  like  logic.  That  is,  its  syntax  is  that  of  logic,  but  the  semantics  are 
different.  In  logic  programming,  programs  are  expressed  in  the  form  of  propositions 
that  assert  the  existence  of  the  desired  result.  The  theorem  pro\er  then  must  construct 
the  desired  result  in  order  to  prove  its  existence.  Thus  a  side  effect  of  the  proof 
produces  the  wanted  result. 

Programming  in  Prolog  is  different  in  style  from  most  programming  in  other 
languages,  and  it  is  called  declarative  programming.  The  main  idea  is  to  write  programs 
as  lots  of  small  modular  pieces.  The  programmer's  emphasis  is  on  writing  correct 
pieces,  and  not  on  figuring  out  how  the  pieces  will  go  together.  The  emphasis  is  on 
whether  each  piece  makes  sense  by  itseif  and  whether  it  is  logically  correct,  not  on 
what  it  does.  Prolog  makes  the  "closed  world  assumption"  or  "lack  of  knowledge" 
inference  [Ref.  21:  p.  71).  That  is.  if  you  cannot  prove  something  is  true,  assume  it  is 
false. 

Prolog  is  not  a  pure  nonprocedural  language  because  it  has  built  in  some 
special  features.  The  "cut"  predicate  for  example  cannot  be  interpreted  declaratively. 
and  has  no  argument.  It  always  succeeds  when  you  first  encounter  it.  As  a  side  effect  it 
throws  away  backtracking  information  to  prevent  normal  Prolog  backtrackine.  A  cut 
predicate  at  the  end  of  a  rule  means  you  want  to  solve  the  problem  only  once. 

As  opposed  to  Prolog,  pure  logic  programming  allows  separation  of  logic  and 
control.  Because  the  clauses  of  a  logic  program  have  no  effect  upon  the  correctness  of 
the  program,  the  meaning  of  the  program  is  tied  to  the  logical  relationship  of  the 
program  clauses,  not  to  the  order  in  which  they  are  executed  [Ref  3).  The  programmer 
can  therefore  focus  on  the  details  of  the  logic  component  when  he  she  is  concerned 
with  program  correctness.  After  a  correct  program  has  been  established,  the 
programmer  can  concentrate  on  the  control  component  for  efficiency  considerations. 
Appendix  B  shows  a  small  sample  program  written  in  Prolog  where  also  the  cut 
predicate  is  used. 

A  frequent  danger  in  Prolog  programming  is  infinite  loops.  Typically  a  rule 
calls  itself  forever,  or  a  set  of  rules  call  one  another  in  a  cycle.  This  happens  more 
often  in  Prolog  than  in  other  languages  because  of  the  emphasis  on  recursion  and 
complicated  backtracking  (Ref  21:  p.  43).  Artificial  intelligence  programs  tend  to  be 


large,  complicated,  and  very  hard  to  debug.  A  problem  suitable  for  artificial  intelligence 
techniques  must  be  sufficiently  well  defined  to  have  dear  cnteria  for  success  and 
failure.  Otherwise  it  is  impossible  to  tell  if  it  is  working  or  not.  Prolog  generaiiv  has 
problems  when  processing  time,  storage  space  and  calculation  accuracy  are  critical.  In 
addit.on  most  people  find  it  difficult  to  read  and  understand  large  Prolog  applications. 

Pure  logic  programming  has  some  disadvantages  that  result  from  the  absence 
of  state  changes.  Therefore  it  is  difficult  to  write  programs  for  those  applications  for 
which  temporal  change  is  an  essential  element.  Examples  are;  databases,  graphics,  real 
time  programs,  interactive  systems  and  operating  systems.  Many  of  the  components  in 
an  interactive  programming  environments  are  of  the  above  type;  editors,  debuggers, 
version  control,  management  systems  etc.  But  Prolog,  as  opposed  to  pure  logic 
programming,  has  assert  '  and  "retract"  to  take  care  of  state  changes,  but  this  is  not 
enough  to  make  Prolog  well  suited  for  implementing  interactive  integrated 
programming  environments. 

3.  Object  Oriented  Languages 

What  is  meant  by  "object  oriented"  is  still  rather  vague,  and  a  lot  of  confusion 
seems  to  exist  even  today  among  computer  scientists.  Some  call  Lisp  an  object 
oriented  language,  others  claim  that  Ada  is  object  oriented  [Ref.  22:  p.  11-19],  There  is 
a  trend  to  give  existing  programming  languages  an  extension  that  includes  object 
oriented  features  but  the  question  is  if  this  makes  them  real  object  oriented  languages. 
An  object  oriented  programming  language  is  one  in  which  the  fundamental  processing 
paradigm  is  simulation.  In  such  a  language  one  often  sends  a  message  to  an  object, 
rather  than  the  more  traditional  approach  of  calling  an  active  procedure  to  operate  on 
some  passive  data  that  is  passed  to  it.  Object  oriented  programming  qualitatively 
enhances  the  design,  creation  and  maintainability  of  software  systems.  Much  of  this 
power  derives  from  modularity,  and  the  fact  that  absolutely  everything  can  be  handled 
as  objects.  The  message  is  not  a  distinguishing  factor  in  object  oriented  programming 
languages,  but  is  often  incorrectly  used  to  describe  them. 

The  object  oriented  process  is  closer  to  high  level  application  programming, 
and  further  from  the  machine  level,  than  the  traditional  approach.  The  object  oriented 
approach  allow  design  and  coding  to  be  done  at  the  same  time.  There  may  be  an 
object  in  the  library  that  nearly  fits  the  requirements,  and  you  can  start  the  coding  with 
this  object.  Instead  of  creating  new  modules  out  of  smaller  subparts,  you  create  new 
objects  by  modifying  existing  objects.  This  means  that  instead  of  piecing  together,  you 
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carve  and  shape  an  existing  object.  This  kind  cf  design  code  technique  seen:,  for  mans, 
to  be  more  natural  and  intuitive  than  the  more  traditional  design  code  process. 

Object  oriented  programming  languages  as  they  are  defined  m  this  thesis  are 
nonprocedural  because  the  programmers  do  less  of  the  how  than  in  more  tradit.or.al 
programming  languages.  The  classification  is  based  on  the  style  of  programming 
because  object  oriented  programming  languages  can  be  nussused'.  i.e.  used  in  a 
procedural  manner. 

Modularity  is  at  the  moment  probably  the  best  technique  available  for 
managing  the  complexity  found  in  software  systems.  This  means  that  certain  types  of 
complexity  are  strictly  contained  within  boundaries  called  module  interfaces.  Many 
programming  language  environments  include  features  that  support  modularity,  but 
only  above  the  level  of  the  procedure  call.  The  object  oriented  approach  incorporates 
modularity  at  the  most  basic  ievei  of  a  software  system. 

A  restaurant  can  be  used  as  an  example  to  demonstrate  the  difference  between 
the  object  oriented  and  the  traditional  approach.  In  an  object  oriented  restaurant,  you 
order  your  food  by  sending  the  chef  a  message  (via  the  waiter).  The  chef  is  assumed  to 
have  the  prerequisite  knowledge  to  take  the  order  from  there  and  prepare  the  food,  but 
you  may  also  give  the  chef  a  specific  recipe  if  you  want  to.  In  a  procedure  oriented 
restaurant,  you  must  send  the  recipe  to  the  chef.  This  means  that  you  must  know 
something  about  cooking  in  order  to  get  a  meal,  and  may  even  need  to  know 
something  about  the  chef.  The  procedure  specification  in  the  procedure  cal! 
establishment  puts  constraints  on  the  chef.  Suppose,  unknown  to  the  patron,  the  chef 
had  learned  a  better  way  to  prepare  the  meal.  Only  in  the  object  oriented  restaurant 
couid  the  new  skills  of  the  chef  really  be  exploited  to  the  benefit  of  everyone. 

What  are  the  minimum  requirements  to  be  an  object  oriented  language?  What 
features  must  be  supported?  This  thesis  will  try  to  show  that  the  language  must 
support  the  following  four  features  to  be  a  real  object  oriented  programming  language: 

1.  Information  hiding. 

2.  Data  abstraction. 

3.  Dynamic  binding. 

4.  Inheritance  hierarchy. 

Especially  the  last  feature,  inheritance  hierarchy,  reduces  the  number  of  languages  that 
can  be  called  object  oriented.  This  feature  also  suggest  that  this  thesis  support  the 
simulation  paradigm,  mere  than  just  message  passing,  as  the  most  important 
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description  of  object  oriented  languages.  This  definition  of  object  oriented 
programming  languages  excludes,  among  others,  the  following  languages:  Ada  supports 
packages  (untyped  data  abstraction)  but  not  inheritance,  CLL  supports  clusters  < typed 
data  abstraction)  but  not  inheritance,  standard  Lisp  does  not  support  information 
hiding,  and  ISO  Pascal  does  not  support  information  hiding. 

There  are  other  concepts  in  object  oriented  programming  that  are  not  centra! 
fo  the  basic  idea  of  object  oriented  programming.  One  is  automatic  storage 
management,  which  is  not  necessary,  but  is  very  useful  when  implemented.  Automatic 
storage  management  techniques  such  as  garbage  collection  and  reference  counting  let 
programmers  ignore  details  concerning  the  release  of  an  object  s  storage  m  memory  . 
Typically  garbage  collection  in  a  real  time  system  is  solved  by  one  of  the  following 
methods: 

1.  Sweep  garbage  collection. 

2.  Separate  memory  management  processor. 

3.  Parallel. 

4.  Spilt  virtual  memory  in  two;  cost  is  only  one  bit  in  address. 

These  techniques  make  the  application  source  code  cleaner,  and  the  overall  software 
system  more  reliable.  Another  concept  is  the  virtual  memory  system  needed  to  take 
advantage  of  ali  the  classes  (objects)  created.  Standard  Smalltalk  does  not  have  a 
virtual  memory  capability,  but  this  problem  does  not  exist  in  most  of  the  newer 
implementations  of  object  oriented  languages.  The  standard  Smalltalk-SO  s  object 
format  allow  a  simple  resident  implementation,  but  at  the  same  time  facilitate  easy 
extensions  to  virtual  memory.  In  the  future  it  may  be  feasible  to  swap  objects  instead 
cf  pages  in  such  a  system.  LOOM  [Ref.  23:  p.  25 1 -2T0J  is  an  example  cf  an 
experimental  single  user  virtual  memory  system  that  swaps  objects  and  operates 
without  assistance  from  the  programmer. 


D.  SUMMARY  OF  THE  CHAPTER 

The  programming  languages  can  in  general  be  classified  as  procedural  or 
nonprocedural.  In  procedural  programming  languages  the  programmer  is  responsible 
for  the  control  structure  of  the  program,  while  m  nonprocedural  programming 
languages  the  programmer  concentrate  cn  what  he  she  wants  the  program  to  do 


instead  of  how  to  do  it.  Ada  and  L,sP  are  examples  of  procedural  proarammm* 
languages,  while  Prolog  and  Smalltalk  are  examples  of  nonprocedural  programming 
languages.  Theoretically  all  programming  languages  are  equally  powerful,  and  can 
represent  the  Turing  machine. 


V.  WHAT  IS  AN  OBJECT  ORIENTED  LANGL AGE? 


A.  HOW  TO  DESCRIBE  AN  OBJECT  ORIENTED  LANGL  AGE? 

!  Genera!  Description 

Most  people  view  processing  in  an  object  oriented  system.  like  Smalltalk,  a1' 
emulation  'Ref's  ld.24j.  The  programming  language  objects  correspond  to  real  world 
objects,  and  the  manipulation  of  these  objects  are  simulated  by  sending  messages  to  the 
programming  language  objects.  Simulation  is  particularly  appropriate  to  systems  that 
must  deal  explicitly  with  the  passage  of  time  and  the  alterations  of  objects  through 
time.  Examples  of  these  systems  are:  interactive  systems,  graphics  systems,  operating 
systems,  editors,  file  systems,  version  control  systems  and  database  systems.  Smalltalk 
and  Simula  are  bu.it  upon  the  framework  of  a  conventional  procedural  language,  and 
this  framework  does  not  take  full  advantage  of  the  simulation  paradigm.  Seme  people 
feel  the  message  sending  is  the  key  to  describe  object  oriented  languages  [  Ref  1  2s j.  but 
this  thesis  shews  that  this  is  not  enough  because  the  four  features  described  m  the  la-t 
chapter  are  r.ot  guaranteed  by  message  sending  alone, 

Many  people  who  have  no  background  in  how  computers  work  find  me  idea 
of  object  oriented  systems  quite  natural.  This  is  probably  caused  by  the  vlose 
correspondence  between  thinking  about  computer  objects  and  the  real  world  objects 
Often  the  object  oriented  program  is  derivable  from  the  real  world  situation  it  is 
intended  to  model.  In  Smalltalk  they  concentrated  on  the  visual  impact  of  bitmapped 
high  resolution  graphics,  on  highly  interactive  user  interfaces,  and  on  increased  support 
for  the  user  in  the  design  and  programming  role.  The  enhancement  to  the  visual 
interface  covers  the  basic  concepts  of  windows,  menus,  and  scrollbars.  In  addmen  the 
interaction  between  the  user  and  the  system  emphasized  the  use  of  a  pointing  dc". ;.c 
■mouse)  rather  than  keyboard  for  selecting  objects,  and  operations  on  objects.  line 
basic  idea  about  how  to  create  a  software  system  in  an  object  oriented  fashion  comes 
mere  naturally  to  those  without  a  preconception  about  the  nature  of  software  sj.  stems 
i  Ref.  >:  p.  “4j. 
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Object  oriented  programming  is  a  technique  well  suited  for  organizing  ver- 
'.urge  and  complex  programs.  It  makes  it  practical  with  current  available  technology  to 
deal  witr.  problems  that  otherwise  would  be  impossibly  complex.  An  object  oriented 
program  consists  of  a  set  of  objects  and  a  set  of  operations  on  these  objects.  The 
dehmticns  of  the  operations  are  distributed  among  the  various  objects  that  they  can 
operate  on.  :e  the  w stem  :s  net  monolitic.  At  the  same  time,  the  definitions  of  the 


objects  are  di'trtbuted  among  the  various  facets  of  their  behavior  [Ref.  2':  p.  1|.  The 
data  values  inside  an  object  can  represent  the  properties  and  relations  in  which  that 
cb.ec:  participates,  and  the  behavior  of  the  programming  language  object  can  model 
the  behavior  of  the  corresponding  real  world  object.  The  main  paradigm  of  object 
^nei'.ted  programming  supported  m  this  thesis  is  simulation,  which  is  quite  natural 
when  you  know  that  many  of  the  ideas  in  object  oriented  languages  are  taken  from 
Simula  •  Simula  =  simulation*  jRef.  3:  p.  4b4].  The  creators  of  Smalltalk  at  Xerox  see 
the  language  characterized  by  the  following  three  principal  attributes  [Ref.  23:  p.  101: 

1  Data  stored  as  objects  which  are  automatically  deallocated. 

2  Processing  effected  by  sending  messages  to  objects. 

Behavior  of  objects  described  in  classes. 

An  object  oriented  language  is  organized  around  objects.  These  objects  are  places  for 
data  storage,  like  Pascal  records  In  addition  they  have  methods,  which  are  routines 
that  operate  on  the  object  s  data.  In  object  oriented  programming  you  decide  cn  your 
data  structure  first,  and  then  afterwards  you  decide  what  routines  you  need  to  operate 
h-.e  data  structures  You  can  do  this  in  ail  languages,  but  m  object  oriented 
laugu  iges  j.cu  can  group  the  data  structures  and  the  routines  together  into  objects. 
\:i  -bject  is  like  a  little  program,  that  dees  its  task  in  an  independent  manner.  Fiach 
program  knows  how  to  dc  its  task,  like  the  chef  in  the  example  described  earlier 
knows  hew  to  prepare  the  meal  you  are  ordering. 

2  Differences  between  Object  and  Procedure  Oriented  Programming 

The  restaurant  chef  example  mentioned  earlier  gives  a  good  feel  for  the 
deference,  but  the  following  examples  will  demonstrate  more  of  the  organizational 
deference  between  the  two  types  of  programming  we  are  discussing,  procedure  oriented 
and  object  oriented.  Most  existing  programs  are  procedure  oriented,  but 
nonprocedural  languages  like  Lotus  1-2-3.  l  ocus.  DBIII  etc.  are  now  becoming  more 
ind  more  important  in  our  normal  job  situation.  Many  cl  these  new  application 
specific,  r.  ..procedural  lammaces.  sacrifice  eiuceney  lor  a  more  user  inendlj.  interlace 


In  a  procedure  oriented  program  the  programs  are  organized  around  procedures  and 
functions,  and  the  programmer  decides  what  task,  needs  to  be  done. 


TABLE  1 

EXAMPLE  OF  PROCEDURAL  ORIENTED  PROGRAMMING 


-tu;f  member  =  RECORD 
rrcie-or  =  RECORD 
Undent  =  RECORD 

PROCEDURE  Salarvi personei) 

If-  personei  =  staff  member  THEN  calculate  salarv  this  wav 

E.LSEIF  personei  =  prctessor  THEN  calculate  salarv  another  wav 

LESLIE  personei  =  student  THEN  calculate  salarv  third  wav 

END 

PROCEDURE  Exercise! personei) 

IF  personei  =  staff  member  THEN  exercise  at  this  time 

ELSEIF  personei  =  prolessor  THEN  exercise  at  another  time 

ELSEIF  personei  =  student  THEN  exercise  vet  another  time 

END 


TABLE  2 

EXAMPLE  OF  OBJECT  ORIENTED  PROGRAMMING 


staff  member  =  OBJECT 

PROCEDL  RE  Salarv  calculated  this  wav 
PROCEDURE  Exercise  at  this  time 

END 

mofessor  =  OBJECT 

PROCEDL  RE  Salarv  calculated  another  wav 
PROCEDURE  Exercise  at  another  time 

END 

student  =  OBJECT 

PROCEDURE  Salarv  calculated  third  wav 
PROCEDURE  Exercise  at  vet  another  time 

END 


for  the  examples  assume  a  program  that  operates  a  school,  in  our  case  operates  on 
staff  members,  professors  and  students.  The  program  need  to  implement  payroll 
(salary)  and  physical  fitness  (exercise)  for  the  staff  member,  professor  and  student.  A 
very  high  level  of  pseudo  code  is  used  in  the  following  examples  just  to  highlight  the 
interesting  points.  The  examples  are  incomplete  programs  (i.e.  the  declaration  part) 
; ust  intended  as  an  illustration  of  the  organizational  difference  between  procedural  and 
object  oriented  programming.  Both  examples  are  supposed  to  solve  the  problem 
without  any  debate  of  which  is  the  best  methodology  to  use. 

The  difference  between  a  procedure  oriented  program  and  an  object  oriented 
program  is  a  matter  of  style.  Beth  approaches  can  do  exactly  the  same  things,  but  each 


approach  has  advantages  in  certain  programming  areas.  The  advantages  of  object 
oriented  programming  result  from  the  simulation  paradigm;  in  particular,  they  are  well 
suited  for  programs  that  deal  with  time,  and  changes  of  state  in  time. 

There  are  several  ways  to  look  at  the  difference  between  object  oriented 
programming  and  procedure  oriented  programming  [Ref.  2S:  p.  147]: 

1.  Code  viewpoint,  in  terms  of  the  program  structure  that  is  created  in  the 
program.  Each  object  that  is  created  can  be  viewed  as  an  independent  entity  in 
the  program.  Each  object  operates  on  the  data  passing  through  it  according  to 
its  own  buiit  in  rules.  Changing  the  object’s  methods  means  changes  to  the  built 
in  rules. 

2.  Data  viewpoint,  in  terms  of  the  data  structures  the  object  handles.  Each  object 
not  only  stores  information  (data),  but  also  processes  the  data.  i.e.  each  of  the 
objects  operates  on  the  information  within  itself. 

3.  Structural  viewpoint,  in  terms  of  the  resulting  way  to  design  code  the  program. 
Instead  of  piecing  together  smaller  modules  to  larger  ones,  you 
specialize  modify  existing  objects. 


B.  TERMINOLOGY  USED 
1.  General  Background 

The  term  "object  oriented  programming"  was  first  used  to  describe  Smalltalk 
programming  environments  developed  at  Xerox.  Smalltalk  took  many  of  its  most 
important  ideas,  such  as  classes  and  objects,  from  a  simulation  ianguage  called  Simula 
that  was  based  on  Algol-60,  and  designed  in  Norway  in  the  1960s.  The  two  languages 
are  different  in  a  number  of  ways.  Simula-67  contains  Algol-60  as  a  subset,  and 
supports:  block  structure,  static  (lexical)  name  binding,  and  compile  time  type 
checking.  Smalltalk  has  none  of  these  features;  it  is  more  in  the  style  of  Lisp  with 
uniform  representation,  dynamic  binding,  and  run  time  type  checking.  In  Smalltalk  the 
designers  combined  the  incremental  program  execution  of  Lisp  with  Simula's  class  and 
virtual  concepts.  Simula  was  designed  from  the  beginning  as  both  a  system 
description  language  and  a  simulation  programming  language. '  [Ref.  29:  p.  12Sj 

This  thesis  will  use  object  oriented  programming  concepts,  terminology  and 
characteristics  from  Xerox  Palo  Alto  Research  Center.  The  designers  of  Smalltalk 
decided  to  let  everything  in  the  system  be  an  object.  This  was  not  only  applied  to  the 
basic  datatypes,  but  also  extended  to  the  state  of  the  system:  activation  records. 
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instructions,  and  program  counters  all  followed  a  specified  format.  The  return  address 
for  even,’  subroutine  call  and  even  the  program  counter  is  an  integer  offset,  not  an 
absolute  address.  Everything  was  constrained  by  this  design,  even  the  most  frequently 
accessed  of  all  data,  i.e  the  instructions.  Smalltalk  is  defined  in  terms  of  an  interpreter 
for  a  virtual  machine  with  a  set  of  instructions.  Smalltalk-80  :  The  Language  and  Its 
Implementation,''  by  Adele  Goldberg  and  David  Robson  plays  an  important  roie  in 
this  picture,  and  is  a  de  facto  standard  when  it  comes  to  concepts  and  terminology  for 
eb;ect  oriented  programming.  The  description  of  Smalltalk  in  this  thesis  builds  on 
work  done  by  Goldberg  and  Robson.  See  appendix  A  for  terminology  for 
Smalltaik-SO. 

The  Smalltalk-SO  programming  system  is  divided  into  two  major  components; 
the  Virtual  Machine  and  the  Virtual  Image.  Protection  of  the  software  is  done  by 
copyrighting  the  Virtual  Image  [Ref.  23:  p.  4).  The  modular  design  of  Smalltalk  makes 
this  approach  for  protection  feasible.  The  virtual  machine  for  a  particular  computer 
s;.  stem  consists  of  an  interpreter,  a  storage  manager,  and  primitives  for  handling  the 
input  output  devices.  The  virtual  image  is  a  large  collection  of  objects  that  make  up 
descriptions  of  classes  providing  basic  data  structures,  basic  graphics  and  text,  viewing 
ar.d  user  interface  support,  compiler,  decompiler,  and  debugger.  Because  Smalltalk  r 
defined  in  terms  of  an  interpreter,  the  virtual  machine  is  easy  to  implement.  All 
systems  running  the  Smalltalk-SO  programming  system  would  therefore  lock  the  same 
to  the  user;  each  system  supports  bitmapped  graphics  and  a  pointing  device.  The 
research  effort  in  Smalltalk  environments  focuses  on  increasing  the  support  that  the 
computer  system  can  provide  to  users  without  a  background  in  computers.  The 
research  is  centered  on  the  visual  impact  of  bitmapped  graphics,  on  highly  interactive 
user  interfaces,  and  on  increased  flexibility  in  terms  of  user  programmability. 

These  design  decisions  show  that  the  human  interface  is  given  priority  over 
hardware  considerations.  Therefore  Smalltalk  executes  rather  slow  due  to,  among 
ethers,  the  following  reasons: 

!  Smalltalk  is  defined  in  terms  of  an  interpreter,  and  interpreters  are  slow. 

2.  Smalltalk  is  uniformly  object  oriented  and  this  implies  a  large  number  of 
messages  (procedure  calls),  which  are  time  consuming. 

3.  Smalltalk  creates  and  destroys  a  large  number  of  objects.  The  memory 
management  system  therefore  has  a  lot  to  do. 

There  are  five  major  concepts  in  Smalltalk:  objects,  messages,  classes, 
instances,  and  methods  [Ref  30:  p.  6-16].  The  Smalltalk  language  is  based  on  these 


five  consistent  abstractions.  System  components  are  represented  by  objects.  Objects 
are  instances  of  classes.  Objects  interact  by  sending  messages.  Messages  cause  methods 
to  be  executed.  Like  Lisp.  Smalltalk  seeks  to  provide  uniform  treatment  of  different 
kinds  of  information:  text,  graphics,  symbols,  and  numbers.  By  packaging  the  behavior 
of  each  form  of  the  -information  with  the  actual  data,  the  information  can  be  shared 
between  programs  without  changing  representation. 

2.  Objects 

Instead  of  two  types  of  entity  that  represent  information  and  its  manipulation 
independently,  as  in  procedure  oriented  languages,  an  object  oriented  language  like 
Smalltalk  has  a  single  type  of  entity,  the  object  that  represents  both.  In  a 
programming  system  that  is  uniformly  object  oriented,  like  Smalltalk,  a  class  is  an 
object  itself. 

An  object  is  an  instance  of  a  class,  and  represents  a  component  of  an  object 
oriented  system.  The  objects  represent  the  components  of  a  software  system.  Objects 
may  have  a  number  of  relationships  with  other  objects.  "One  object  may  be  part  of 
another  object,  or  (as  in  an  operating  system)  the  owner  of  another  object." 
[Ref.  31:  p.  6]  An  object  consists  of  some  private  memory  and  a  set  of  operations.  The 
nature  of  the  operation  or  an  object  depends  on  the  type  of  component  it  represents. 
That  :s.  objects  representing  numbers  compute  arithmetic  functions,  etc.  "At  any  point 
in  time  an  object  has  a  state,  which  is  the  sum  total  of  its  relationships  with  all  other 
objects  in  the  system."  In  systems  like  Smalltalk  the  instance  variables  determine  the 
state  of  an  object  and  the  methods  defined  in  the  object's  class  determine  the  object  s 
behavior  in  time."  [Ref.  31:  p.  6]  An  object  has  the  following  characteristics 
[Ref.  31:  p.  6"]: 

1.  Objects  are  temporal,  i.e.  they  exist  in  time." 

2.  "Objects  are  mutable,  and  have  a  state." 

3.  "Objects  can  be  created  and  destroyed." 

4.  "Objects  are  particular,  and  can  be  shared." 

The  first  job  for  the  designer  programmer  is  to  choose  what  he  she  wants  to 
be  the  objects  in  the  problem  he  she  is  trying  to  solve.  Objects  can  be  anything, 
examples  are:  numbers,  programs,  character  strings,  compilers,  computational 
processes,  text  editors. 


3.  Messages 

A  message  is  a  request  for  an  object  to  carry  out  one  of  the  operations  from 
its  own  set  of  operations."  [Ref.  30:  p.  6]  The  message  specifies  which  operation  it 
wants,  but  not  how  the  operation  should  be  carried  out.  The  object  to  which  the 
message  was  sent  determines  how  to  perform  the  requested  operation.  Messages  insure 
the  modularity  cf  the  system  by  specifying  the  type  of  operation  desired,  but  not  how 
to  perform  the  operation.  An  object's  interface  is  the  set  of  messages  to  which  the 
object  can  respond.  Interaction  with  an  object  goes  through  its  interface,  and  its 
private  memory  can  only  be  manipulated  by  its  own  operations.  Messages  are  the  only 
way  to  invoke  an  object's  operations.  These  properties  provide  security  since  the 
implementation  cf  an  object  cannot  depend  on  the  internal  details  of  other  objects, 
only  on  the  message  to  which  it  responds.  The  essential  point  is  that  the 
designer  programmer  decides  on  the  data  structure  first,  and  then  which  routines  are  to 
operate  on  these  data  structures.  An  object's  private  properties  are  a  set  of  instance 
variables  that  make  up  its  private  memory  and  a  set  of  methods  to  describe  how  to 
carry  out  its  operations. 

In  the  Smalitalk-SO  programming  system,  objects  and  messages  are  used  to 
implement  the  entire  programming  environment.  The  designer  programmer  determines 
which  kind  of  objects  should  be  described,  and  which  message  names  provide  a  useful 
vocabulary  of  interaction  among  these  objects.  This  is  an  acquired  design  skill,  and  it 
takes  time  to  master.  Messages  represent  the  interaction  between  the  components  of 
Smail talk-80:  the  arithmetic,  control  structure,  file  creation,  text  manipulation, 
compilation,  and  application  uses  [Ref.  30:  p.  40).  The  messages  make  an  object's 
functionality  available  to  other  objects,  while  keeping  the  object's  implementation 
hidden.  The  entire  programming  system  becomes  accessible  as  soon  as  the  objects  and 
the  messages  are  understood. 

4.  Classes 

In  Smalltalk,  you  describe  a  new  type  of  object  before  creating  it.  When  you 
are  done,  the  description  also  works  for  a  whole  class  of  objects.  Such  an  object 
description  is  called  a  class.  Any  object  created  from  the  description  is  called  an 
instance  of  the  class.  A  class  includes  a  method  for  each  type  of  operation  its  instances 
can  perform  'Ref.  30:  p.  9j. 


"The  class  describes  the  implementation  of  a  set  of  objects  that  all  represent 
the  same  kind  of  system  component."  [Ref.  30:  p.  S]  In  other  words  a  class  is  just  a 
name  for  a  particular  kind  of  object.  The  individual  class  describes  the  form  of  its 
instances,  private  memory,  and  how  they  earn.'  out  their  operations.  The  class  provides 
all  the  information  necessary  to  construct  and  use  objects  of  a  particular  kind, 
including  the  storage  for  methods.  [Ref.  3:  p.  40] 

5.  Instances 

An  instance  is  one  of  the  individual  objects  described  by  a  class.  Each  instance 
has  one  class,  but  one  class  may  have  multiple  instances.  Each  instance  has  storage 
allocated  to  maintain  its  own  state,  and  the  state  is  referenced  by  instance  variables. 
Each  object  has  its  own  set  of  instance  variables.  A  class  includes  a  method  for  each 
type  of  operation  its  instances  can  perform.  In  Smalltalk  the  attributes  of  an  object 
are  represented  by  instance  variables,  whose  values  are  themselves  objects.  All 
instances  of  a  class  represent  the  same  kind  of  system  component.  This  means 
[Ref.  30:  p.  56]: 

1.  All  instances  of  a  class  respond  to  the  same  set  of  methods. 

2.  All  instances  of  a  class  have  the  same  number  of  named  instance  variables  and 
use  the  same  names  to  refer  to  them. 

3.  An  object  can  have  indexed  instance  variables  only  if  all  instances  of  its  class 
can  have  indexed  instance  variables. 

6.  Methods 

A  method  in  a  class  tells  how  to  carry  out  the  operation  requested  by  a 
particular  type  of  message.  When  the  special  type  of  message  is  sent  to  any  instance  cf 
the  ciass.  the  method  is  executed.  The  class  includes  a  method  for  each  type  of 
operation  its  instances  can  perform.  The  object's  methods  can  access  the  object's  own 
instance  variable,  but  not  those  of  any  other  objects.  Methods  are  simply  procedures  or 
subroutines  that  are  invoked  by  sending  messages  to  a  class  instance. 

A  small  subset  of  the  methods  in  Smalltalk-80  are  not  expressed  in  the 
Smalltalk-80  programming  language  [Ref.  30:  p.  9],  These  are  the  primitive  methods; 
they  are  built  into  the  virtual  machine,  and  cannot  be  changed  by  the  application 
programmer.  The  primitives  are  invoked  with  messages  exactly  like  other  methods.  The 
purpose  of  the  primitive  methods  is  to  allow  access  to  the  underlying  hardware  and 
virtual  machine  structure. 


C.  INFORMATION  HIDING 


1.  Definition 

Information  hiding  is  to  hide  the  data  structure  used  in  one  module  from  the 
rest  of  the  program,  i.e.  prevent  access  to  the  data  structure  from  outside  of  the 
module.  Information  hiding  is  formalized  by  the  following  two  principles  [Ref.  3:  p. 
2921: 

1.  One  must  provide  the  intended  user  with  all  the  information  needed  to  use  the 
module  correctly  and  nothing  more." 

2.  One  must  provide  the  implementor  with  all  the  information  needed  to  complete 
the  module  and  nothing  more." 

Modularization  is  the  division  of  a  program  into  a  number  of  independent  modules. 
Each  module  is  like  a  small  program  that  can  be  implemented  independently  of  the 
other  modules.  The  result  of  each  design  decision  can  be  hidden  in  the  corresponding 
module,  if  this  decision  is  later  changed  only  that  module  has  to  be  modified.  This  is 
called  information  hiding  [Ref.  321- 

Modularization  can  also  be  viewed  as  taking  a  large  complex  program  and 
splitting  it  into  several  little  programs  using  the  following  principles  [Ref.  33]: 

1.  Cohesion;  each  module  should  perform  a  single  complete  logical  function. 

2.  Coupling;  each  module  should  have  access  to  only  those  data  elements  that  it 
needs  to  complete  the  assigned  task  (the  "need  to  know’  factor). 

This  is  much  like  the  military  methodology  of  "divide  and  conquer."  Programs  can 
then  be  designed  as  a  series  of  linked,  single  functional  logical  modules. 

2.  Information  Hiding  in  Object  Oriented  Languages 

In  simulation  it  is  often  necessary  to  implement  an  object  in  terms  of  lower 
lev  el  objects.  To  preserve  the  integrity  of  the  simulation  it  is  important  to  distinguish 
these  objects  and  relationships  that  are  part  of  the  simulation  from  the  objects  and 
relationships  that  are  not  part  of  it."  This  distinction  also  facilitates  "the  modular 
decomposition  of  the  system."  Smalltalk  and  Simula-67  both  allow  objects  to  be  used 
in  the  implementation  of  other  objects,  but  they  do  not  enforce  the  boundary  between 
these  levels  of  abstraction.  Enforcement  of  boundaries  have  been  implemented  m  more 
recent  object  oriented  languages,  including  recent  Simula  editions.  [Ref.  19:  p.  12] 

Information  hiding  ensures  reliability,  testability,  and  modifiability  of  a 
software  system  by  reducing  the  interdependencies  between  software  components.  The 
internal  data  structures  and  procedures  can  be  changed  without  affecting  the 


implementation  of  other  modules  when  the  internal  state  variable  of  a  module  is  net 
directly  accessed  from  the  outside.  Most  modern  programming  languages  support 
information  hiding  to  some  degree.  The  only  important  exception  known  to  me  is  ISO 
Pascal,  which  provides  no  way  to  declare  static  variables  within  the  sccpc  of  a 
procedure.  Standard  Lisp  does  not  support  information  hiding  either,  but  newer 
implementations  of  the  language  do;  they  use  something  hke  the  package  ’  in  Ada  to 
support  this  feature  in  Lisp.  Smalltalk's  programming  environment  uses  objects  and 
messages  to  facilitate  modular  design.  Also  other  languages  use  objects  and  messages 
lor  this  purpose;  Simula  uses  them  for  describing  simulations,  and  Hydra  uses  them  for 
describing  operating  system  facilities  in  a  distributed  system. 


D.  data  abstraction 

1.  Definition 

"Avoid  requiring  something  to  be  stated  more  than  once;  factor  out  the 
recurring  pattern."  [Ref.  3:  p.  12]  People  think  and  understand  by  means  of 
abstraction,  and  abstraction  is  the  major  technique  for  understanding,  and  inventing,  of 
complex  structures  in  the  real  world.  An  abstraction  provides  a  simple  view  cf  a 
structure,  and  summarizes  its  interesting  and  important  properties.  Hoare  [Ref.  10:  p. 
S']  stated  it  like  this:  "In  the  development  of  our  understanding  of  complex 
phenomena,  the  most  powerful  tool  available  to  human  intellect  is  abstraction. '  Each 
of  the  elements  of  an  abstraction  itself  can  be  an  abstraction  for  details  at  a  lower  tor 
hignen  level.  At  a  given  level  of  abstraction  the  view  must  be  simple  or  the  abstraction 
is  inappropriate.  The  major  idea  is  to  use  abstractions  to  make  it  easier  for  the  human 
being  to  understand  a  complex  problem.  Data  abstraction  is  the  ability  to  define  a 
type  by  specifying  the  operations  that  are  meaningful  for  it,  without  exposing  the 
representation  of  the  type. 

In  general  the  introduction  of  an  abstraction  layer  will  reduce  the  elTiciency 
cl'  the  algorithms.  Optimized  compilers  can  remove  this  problem,  but  in  dynamic 
binding  languages,  like  Lisp  and  Smalltalk,  the  added  functionality  means  a  significant 
loss  in  performance. 


2.  Data  Abstraction  in  Object  Oriented  Languages 

According  to  our  definition  of  object  oriented  programming  languages  Pascal 
and  Ada  are  not  object  oriented  because  they  do  not  support  all  four  features  in  the 
definition.  Data  abstraction  in  object  oriented  programming  languages  are  discussed  in 
relation  to  Pascal  and  Ada  in  this  section. 

Some  languages,  like  Pascal,  allow  the  programmers  to  define  new  data  types, 
but  the  representation  (e.g.  array,  record  etc.)  of  the  type  is  not  hidden  from  other 
parts  of  the  program.  When  the  representation  is  exposed,  the  abstract  properties 
cannot  be  assured.  Given  an  object  of  a  program  defined  type,  other  parts  of  the 
program  can  access  the  representation  directly  and  hence  can  violate  the  abstraction 
properties.  Object  oriented  programming  is  also  an  abstraction  mechanism.  In  some 
object  oriented  programming  languages,  like  Smalltalk,  objects  that  have  a  lot  in 
common  are  grouped  together  in  one  class.  This  is  an  abstraction,  but  also  it  shows 
how  data  abstraction  can  be  considered  a  way  of  using  information  hiding.  Some 
people  [Ref.  22:  p.  11-19]  view  the  important  features  of  object  oriented  languages  as 
information  hiding  and  data  abstraction.  This  definition  would  include  Ada  in  object 
oriented  languages.  Our  definition  of  object  oriented  programming  languages  excludes 
Ada,  but  because  it  is  one  of  the  newer  procedural  programming  languages  it  :s 
studied.  Ada  includes  abstractions  for  both  control  and  data.  The  control  abstraction 
includes  procedures  and  functions,  but  also  assignment  statements,  if  statements,  block 
statements,  etc.  [Ref.  34:  p.  9],  Ada  uses  several  features,  operator  overloading, 
generic  program  units,  and  packages,  to  implement  data  abstraction.  The  data 
abstraction  in  Ada  is  established  at  compile  time. 

In.  for  example,  Smalltalk  the  internal  structure  can  be  hidden  from  other 
objects.  This  data  abstraction  can  be  illustrated  with  the  following  example:  Suppose  a 
program  is  written  with  the  purpose  of  playing  chess,  or  as  in  this  case  restricted  to 
movement  of  the  chess  pieces.  A  Smalltalk  program  would  invoke  a  method 
"mcve_to."  passing  the  destination  square  as  a  parameter.  The  point  is  that  an 
assignment  statement  is  not  used  to  modify  the  data  structure  describing  the  chess 
nieces'  positions  on  the  board.  The  main  advantage  is  that  both  the  representation  of 
the  chess  pieces  and  the  implementation  of  "move_to"  can  be  changed  without  altering 
the  code  in  other  objects  that  access  them. 

Dynamic  binding  seems  to  be  the  only  reasonable  solution  to  the  data 
abstraction  problem  if  a  more  general  reuse  of  code  is  wanted.  It  is  possible  to  write 


general  procedures  that  use  variables  and  procedures  supplied  by  the  caller  s 
environment. 


E.  DYNAMIC  BINDING 

1.  Definition 

With  dynamic  binding  the  meaning  of  statements  and  expressions  are 
determined  by  the  dynamic  structure  of  the  computation  evolving  in  time,  that  is  at  run 
time.  U  ith  static  binding  the  meaning  of  statements  and  expressions  are  determined  bv 
the  static  structure  of  the  program.’’  [Ref.  3:  p.  119]  In  static  scoping  a  procedure  is 
called  in  the  environment  of  its  definition,  but  in  dynamic  scoping  a  procedure  is  called 
;n  the  environment  of  its  caller.  Scoping  rules  apply  uniformly  to  all  names,  not  onlv 
for  variable  names.  One  of  the  advantages  of  dynamic  scoping  is  that  it  is  possible  to 
write  a  general  procedure  that  makes  use  of  variables  and  procedures  supplied  by  the 
caller's  environment. 

2.  Dynamic  Binding  in  Object  Oriented  Languages 

The  object  oriented  programming  style  pushes  the  responsibility  for 
intepretation  of  the  message  onto  the  objects  themselves.  Smalltalk-SO  has  dynamic 
binding  of  methods  to  a  message  based  on  the  class  of  its  receiver.  This  dynamic 
binding  requires  a  lookup  of  the  selector  in  the  message  dictionaries  of  the  superclass 
chain  for  the  receiver.  Each  object  is  sent  exactly  the  same  message  selector,  but  the 
object  itself  determines  how  to  perform  the  requested  operation.  This  ensures  that 
implementation  of  an  object  cannot  depend  on  the  internal  details  of  other  objects, 
omy  on  the  message,  to  which  they  responds.  The  same  message  can  elicit  a  different 
response  depending  on  the  receiving  object.  Smalltalk-80,  for  example,  is  a 
dynamically  typed  programming  language,  and  is  therefore  generally  harder  to  compile 
than  interpret.  Operator  overloading  in  Ada  does  not  have  this  form  of  dynamic 
polymorphism  since  the  address  of  the  procedure  invoked  is  fixed  at  compile  time 
[Ref.  35:  p.  142], 


F.  INHERITANCE 


1  Definition 

Inheritance  is  the  sum  total  of  "genetic  characteristics  derived  or  acquired 
from  ancestors.'"  [Ref.  36]  Inheritance  is  especially  concerned  vith  the  management  of 
change,  and  it  is  the  key  to  simulation,  and  reuse  of  code. 

2.  Inheritance  in  Object  Oriented  Languages 

Classes  are  related  to  one  another  by  an  inheritance  relationship,  and 
inheritance  is  fundamental  to  the  object  oriented  paradigm.  In  Smalltalk,  for  instance, 
inheritance  is  interpreted  as  follows  [Ref.  3~:  p.  89]: 

1  If  class  B  inherits  from  class  A,  then  objects  of  class  B  supports  all  operations 
supported  by  objects  of  class  A. 

2.  If  class  B  inherits  from  class  A,  then  class  B's  instance  variables  are  a  superset 
of  class  A  s  instance  variables. 

3.  If  class  B  inherits  from  class  A.  then  the  code  of  any  methods  not  explicitly 
written  for  class  B  will  be  obtained  from  class  A. 

Inheritance  can  be  used  to  define  a  class  in  terms  of  one  or  more  other  classes.  If  a 
class  B  inherits  directly  from  a  class  A.  we  say  that  A  is  the  parent  of  B  and  that  B  is 
the  child  of  A.  The  terms  ancestor  and  descendant  are  used  as  normal,  and  follow  the 
inheritance  chain. 

See  Figure  5.1  for  a  visual  representation  of  inheritance.  In  this  case  class  B 
inherits  from  class  A,  and  B  is  the  child  of  parent  A.  Inheritance  enables  programmers 
to  create  new  classes  of  objects  by  specifying  the  difference  between  a  new  class  and  an 
existing  class.  Object  types  can  have  ancestor  object  types  from  which  they  inherit 
characteristics,  and  the  descendant  type  can  change  characteristics  inherited  from  its 
ancestor.  Seme  object  oriented  systems  allow  inheritance  between  all  the  objects  :n  the 
system,  but  normally  only  inheritance  between  classes  are  allowed.  A  class  may  be 
modified  to  create  another  class.  The  class  that  creates  the  other  class  is  called  the 
superclass,  and  the  other  class  is  called  the  subclass.  The  subclass  inherits  everything 
about  its  superclass.  The  terms  subclass  and  superclass  are  often  used  ambigcusly  to 
mean  both  direct  and  indirect  inheritance,  in  CommonObjects  [Ref.  38],  for  instance, 
the  distinction  between  direct  and  indirect  inheritance  is  particularly  important  because 
inheritance  is  a  mechanism  for  defining  objects  whose  interfaces  include  all  the 
operations  defined  for  another  class  without  saving  anything  about  the  internal 
representation.  In  Smalltalk,  for  instance,  inheritance  is  primarily  a  mechanism  for 


Figure  5.1  Inheritance  Example. 

building  more  complex  code  bodies  out  of  simpler  ones,  and  more  complex  data 
structures  out  of  simpler  ones.  A  large  amount  of  code  can  therefore  in  Smalltalk  be 
reused  because  it  is  not  necessary  to  start  from  scratch  each  time. 

G.  SOME  ADVANTAGES  AND  DISADVANTAGES  IN  OBJECT  ORIENTED 
PROGRAMMING 

1.  Advantages 

In  general  the  object  oriented  programming  process  is  closer  to  the  high  level 
application  programming  than  most  traditional  techniques.  The  main  point  seems  to 
be  that  it  is  easier  to  start  with  an  idea,  and  from  the  idea  design  code  more  easily 
follows  because  we  do  not  need  a  complete  specification  of  the  problem  to  get  started. 
Existing  objects  from  other  programs  or  libraries  can  be  used  to  form  new  programs 
instead  of  always  starting  from  scratch.  This  generally  saves  time.  Normally  the 
objects  we  start  with  are  bug  free,  the  specialization, changes  we  add  to  them  will 
therefore  be  easier  to  perform  and  the  time  spent  on  debugging  decreases.  Information 
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hiding  is  supported  so  each  of  the  objects  can  be  viewed  as  a  closed  universe,  therefore 
the  design  code  is  more  orderly.  And  finally  the  programs  are  easier  to  maintain. 

In  general  we  do  not  have  to  worry  so  much  about  the  details  of  the 
algorithms,  but  rather  work  with  the  description  of  the  properties  of  the  objects  we  use 
to  build  the  program.  We  are  at  a  higher  level  of  abstraction  and  specifications  of 
behavior.  ;.e.  we  worry  about  what  to  do  instead  of  how  to  do  it.  The  hardware 
resources  are  taking  over  much  of  the  work  previously  done  by  the 
designer  programmer. 

2.  Disadvantages 

MacLennan  [Ref.  19:  p.  3]  states  the  following  general  disadvantages  of  object 
oriented  programming: 

1.  "It  is  difficult  to  reason  about  things  that  change  in  time." 

2.  Object  oriented  languages  provide  little  ability  for  algebraic  manipulation. 

3.  The  analysis  of  object  oriented  programs  can  be  hard." 

In  addition  it  is  hard  to  master  object  oriented  languages,  like  Smalltalk,  because  there 
is  such  an  enormous  number  of  objects  available  in  the  system.  It  is  generally  easy  to 
get  started  using  a  subset  of  the  system,  but  to  use  all  the  built  in  capabilities  in  the 
library  takes  a  long  time  to  learn  and  master.  The  naming  convention  in  Smalltalk  is 
er.erally  better  than  in  Lisp.  i.e.  the  names  used  on  objects  gives  a  good  indication  of 
the  purpose  of  the  objects.  Lisp  also  has  a  large  number  of  functions,  often  with 
names  that  do  not  give  a  good  indication  of  the  purpose  of  the  functions. 

The  Smalltalk-SO  system  is  not  designed  to  run  background  processes,  or  to  be 
run  on  a  time  shared  computer  system.  Much  of  the  reason  for  this  is  all  the  loops  in 
the  source  cede  waiting  for  a  pointing  device  (mouse)  input,  i.e.  the  cost  of  a  user 
frier.dlv  window  interface. 


H.  SUMMARY  OF  THE  CHAPTER 

The  main  paradigm  of  object  oriented  programming  is  simulation.  Object 
oriented  languages  are  organized  around  objects,  and  group  data  structures  and 
routines  together  into  objects.  Smalltalk-SO  is  a  typical  example  of  object  oriented 
programming  languages  that  supports  the  four  criteria  in  this  thesis:  information 
hiding,  data  abstraction,  dynamic  binding,  and  inheritance  hierarchy. 


There  are  five  major  concepts  in  Smalltalk.:  object,  message,  class,  instance  and 
method.  System  components  are  represented  by  objects,  objects  are  instances  of 
classes,  objects  interact  by  sending  messages,  and  messages  causes  methods  :o  be 
executed. 

I  he  mam  adwntages  are  that  ue  can  program  at  a  higher  level  of  abstraction 
and  specification  of  behavior,  in  addition  the  dynamic  binding  makes  it  possible  to 
■Ante  general  procedures  that  use  variables  and  procedures  supplied  by  the  caller's 
environment.  The  disadvantages  are  long  learning  time  due  to  large  numbers  of 
objects  and  classes.  The  dynamic  binding  makes  the  system  slow. 


VI.  INHERITANCE 


i 


A.  BACKGROUND 

"In  a  simulation  of  any  complexity  it  is  infeasible  to  describe  the  behavior  of 
every  individual  object,"  because  the  simulated  "world"  consists  of  such  an  enormous 
number  of  objects  [Ref.  31V  Due  to  this  infinite  mass  of  information  it  is  impossible  to 
represent  all  the  details  about  the  whole  universe  inside  a  computer  system.  It  is  easier 
to  group  objects  into  classes  of  similarilv  behaving  individuals  (abstraction),  so  that 
their  common  behavior  can  be  described  just  once.  "Abstraction  is  the  decision  to 
concentrate  on  properties  which  are  shared  by  many  objects  or  situations  in  the  real 
world,  and  to  ignore  the  differences  between  them."  [Ref.  10:  p.  84]  In  object  oriented 
programming  languages,  like  Smalltalk,  all  computation  is  viewed  as  simulation,  the 
paradigm  supported  in  this  thesis,  and  programming  language  objects  correspond  to 
real  world  objects.  The  purpose  of  a  computer  program  is  normally  the  modeling  of 
some  aspect  of  the  real  world,  often  involving  the  changing  relationships  among  real 
world  objects.  Abstraction  for  the  simulation,  and  selection  of  relevant  subparts  of  the 
universe,  is  needed  in  order  to  design  and  code  a  computer  program.  "The  state  of  the 
simulation  is  represented  by  a  finite  number  of  objects  connected  by  a  finite  number  of 
relationships."  [Ref.  3:  p.  9)  When  a  program  executes,  objects  may  be  created  or 
destcyed.  and  the  relationships  among  them  may  change.  In  addition  to  the  real  world 
objects  we  have  the  "nonreal"  objects  that  can  be  examplified  by  the  what  if 
questions  m  spreadsheet  applications. 


B.  INTENSION  VERSUS  EXTENSION 

Because  the  computer  is  not  abie  to  represent  the  whole  universe  in  a  simulation 
it  is  necessary  to  distinguish  between  intension  and  extension.  In  an  object  oriented 
'language,  like  Smalltalk,  the  entire  state  of  the  simulation  is  therefore  represented  b>  a 
finite  number  of  objects  connected  by  a  finite  number  of  relationships.  "Two  relations 


have  the  same  intension  if  they  are  supposed  to  model  the  exact  same  externa! 
relationships  or  properties.  They  have  the  same  extension  if  they  apply  to  the  same 
object.'  tRef.  31:  p.  12]  and  extensions  may  vary  over  time.  Two  relations  may  be 
extensior.ally  the  same  even  though  they  are  intensionally  different.  The  intension  of  a 
computer  object  is  the  real  world  object  it  is  modeling;  the  extension  of  a  computer 
object  is  the  set  of  relations  to  which  it  belongs.  Objects  with  different  intensions  may 
coincident!}  nave  the  same  extension.  Therefore  two  or  more  computer  objects  that  are 
intended  to  model  distinct  real  world  objects  may  agree  in  all  the  modeied  properties 
and  relationships.  This  may  happen  because  a  computer  system  is  unable  to  represent 
ever,  property  and  relationship  in  the  universe.  The  designer  programmer  must  select 
i  unite  number  of  these  objects  and  relations  that  are  relevant  to  the  problem  he  she 
wants  to  solve.  The  programming  language  system  may  have  problems  distinguishing 
intensionally  distinct  relations  that  happen  to  have  same  extension. 


c.  INHERITANCE  in  general 

1  Inheritance  versus  Data  Abstraction 

Object  oriented  programming  encourages  modular  design  and  software  reuse. 
Data  abstraction  is  the  ability  to  define  new  types  of  objects  whose  behavior  is  defined 
abstractly.  Normally  an  object  oriented  language  supports  data  abstraction  by 
preventing  an  object  from  being  manipulated  by  other  means  than  via  its  defined 
external  operations.  The  fundamental  idea  of  inhentance  is  that  software  modules  may 
be  defined  as  extensions  (specializations)  of  previously  defined  software  modules.  The 
original  software  module  does  not  have  to  be  modified  when  it  is  used  as  a  basis  for  a 
new  extension.  In  object  oriented  programming  where  the  basic  software  modules  are 
based  upon  abstractions,  an  extension  of  a  software  module  would  then  correspond  to 
a  refinement  (specialization)  of  hierarchies  of  abstractions. 

Inheritance  compromises  encapsulation  (modularization)  in  many  object 
oriented  languages.  For  example  Smalltalk  lets  the  programmer  access  the  inherited 
instance  variables.  The  benefits  from  encapsulation  are  improved  understandabdity  ol 
programs  and  easier  program  modification.  To  be  able  to  do  debugging  and  creation 
of  programming  environments  most  programming  languages  provide  wju  to 
circumvent  encapsulation.  In  Smalltalk,  for  example,  the  operation  "insiVarAt:  and 


instVarAtiput '  allow  access  to  any  named  instance  variable  of  any  object  (Ref.  30:  p. 
2-Tj.  This  is  special  features  that  will  net  be  covered  more  in  detail.  Inheritance 
complicates  the  situation  by  introducing  a  new  category  of  client  (user)  for  a  class.  In 
addition  to  clients  that  simply  instantiate  objects  of  the  class  and  perform  operations 
on  them,  aiso  other  clients  (class  definitions)  inherit  from  the  class.  Venn  diagrams  will 
be  u<ed  as  a  descriptive  tool  in  this  section  in  order  to  clarify  the  inheritance  problem. 

'  Ref.  39:  p.  3SJ. 

Many  of  the  ideas  in  this  section  of  the  thesis  builds  on  work  done  by  Adele 
Goldberg  and  David  Robson. 

2.  Subclassing 

Single  inheritance  is  the  case  where  the  inheriting  class  (the  child),  directly 
inherits  from  a  single  class,  the  parent.  Hierarchical  classification  is  to  factor  out 
common  behavior  of  several  classes  of  objects.  Smalltalk  and  Simula  do  this  by 
permitting  classes  to  be  subclasses  of  other  classes.  In  our  definition  of  object  oriented 
programming  the  feature  inheritance  hierarchy  is  the  one  that  supports  the  simulation 
paradigm,  and  makes  the  object  oriented  languages  so  special.  Without  inheritance  the 
addition  of  a  new  type  of  object  requires  writing  entirely  new  procedures  for  common 
operations.  There  will  be  a  great  deal  of  similarity  between  these  different  methods,  but 
there  will  be  a  need  for  continous  rewriting  of  methods  that  differ  slightly  or  not  at  all. 
Inheritance  can  reduce  this  burden,  and  drastically  reduce  the  number  of  lines  of  code 
in  a  program.  Smalltalk-SO  s  class  hierarchy  builds  on  run  time  checking,  and  run  time 
binding  of  messages  to  methods. 

Dav-.d  Sandberg  [Ref.  40]  describes  an  alternative  to  subclassing  that  uses 
compile  time  typing,  adds  parameters  to  classes,  and  introduces  a  new  form  of  class 
called  a  descriptive  class.  This  alternative  supports  building  larger  modules  from 
smaller  modules,  while  Smalltalk-80  encourages  refining  the  behavior  of  an  existing 
class  by  creating  subclasses.  Like  subclassing,  the  descriptive  classes  allow  sharing  of 
vcde.  The  subject  of  descriptive  classes  wiii  not  be  pursued  further  in  this  thesis. 

The  class  structure  described  so  far  does  not  explicitly  provide  for  any 
intersection  in  class  membership,  see  Figure  6.1  for  a  visual  representation.  None  of 
the  classes  in  the  figure  overlap,  so  there  are  no  shared  objects  between  classes.  In  the 
Venn  diagrams  used  in  this  section,  the  circles  represent  the  classes  and  the  black  dots 
the  instances.  The  representation  of  an  instance  of  a  class  would  then  be  a  black  dot 
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Figure  6.1  Without  Intersection  in  Class  Membership. 


The  task  of  creating  a  specialization  of  an  existing  class  is  called  subclassing, 
and  the  existing  class  is  a  superclass  of  the  new  class.  The  classes  in  Smailtalk-SO  form 
a  tree;  more  than  one  class  may  share  the  same  superclass,  but  each  class  has  only  one 
immediate  superclass.  The  root  of  the  tree  is  the  class  "object,''  this  is  the  only  class 
without  a  defined  superclass.  In  general  the  instances  of  a  superclass  cannot  be 
affected  by  the  existence  of  one  or  more  subclasses.  Therefore  variable  names  added  to 
a  subclass  must  be  different  from  any  variable  declared  in  the  superclass.  The  subclass 
inherits  instance  variables,  class  variables,  and  methods  from  its  superclass.  The 
subclass  may  add  instance  variables  and  class  variables  to  make  the  subclass  more 
specialized  than  the  superclass  from  which  it  derives.  A  subclass  can  also  override  or 
pres  ide  additional  behavior  to  methods  of  a  superclass.  Methods  are  overridden  when 
a  new  method  for  an  old  method's  selector  ts  provided  [Ref.  35:  p.  1 42 j.  If  a  new 
method  makes  use  of  the  old  method  in  Smalltalk  a  message-send  to  the 
pseudcvariable  "'super'"  is  embedded  in  the  new  definition  of  a  method.  The  new 
behavior  may  precede,  follow,  or  surround  the  existing  behavior. 

If,  m  Smalltalk,  during  a  message-send  a  method  is  not  found  in  the 
immediate  class  of  the  instance,  the  superclass  method  dictionary  is  checked,  if  not 


Figure  6.2  Subclasses. 


found  there  either  the  superclass's  superclass  is  searched  and  so  on  until  the  "object" 
(the  root  of  the  tree)  is  searched.  If  even  the  class  "object"  does  not  have  a  matching 
selector  then  an  error  message  is  returned. 

Subclassing  is  to  allow  a  class  to  include  all  instances  of  another  class,  but  not 
to  allow  more  general  sharing  in  class  membership  (i.e.  not  multiple  inheritance),  see 
Figure  6.2  for  a  visual  representation.  A  subclass  specifies  that  its  instances  will  be  the 
same  as  instances  of  another  class,  its  superclass,  except  for  the  differences  that  are 
explicitly  stated.  Smalltalk-80's  subclassing  is  a  pure  hierarchical  system;  i.e.  if  there 
are  any  instances  of  a  class  that  also  are  instances  of  another  class,  then  absolutely  all 
the  instances  of  that  class  must  be  instances  of  the  other  class.  In  Smalltalk  the  classes 
themselves  are  considered  to  be  objects  belonging  to  a  meta  class.  A  meta  class  is  a 
c la i s  whose  instances  themselves  are  classes.  There  is  a  one-to-one  correspondence 
between  a  class  and  its  meta  class. 

The  use  of  classes  and  metaclasses  provides  a  mechanism  for  sharing 
information  between  different  objects  via  inheritance.  Inheritance  is  not  the  only 
scheme  lor  information  sharing,  and  the  requirement  that  each  object  permanently 
belongs  to  a  class  imposes  constraints  on  the  mutability  of  the  behavior  of  an  object. 
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An  example  of  another  mechanism  for  information  sharing  is  delegation"  [Ref.  41:  p. 
60],  Using  delegation,  subcomputations  can  be  passed  on  by  an  actor  to  another  actor 
which  continues  the  processing.  Delegation  provides  a  mechanism  for  code  sharing 
where  the  control  is  passed  to  an  independent  actor.  In  inheritance  mechanisms,  on 
the  ether  hand,  information  may  be  requested  from  a  more  general  class  to  which  an 
actor  belongs,  but  the  control  remains  localized.  Actors  are  computational  agents 
which  carry'  out  their  actions  in  response  to  incoming  messages.  Actors  encapsulate 
procedural  and  declarative  information  into  a  single  entity.  High  level  actor  languages 
use  inheritance  for  conceptual  organization  and  delegation  for  structuring  the  sharing 
of  code  between  different  actors. 

CommonObjects  [Ref.  39]  is  an  extension  of  Common  Lisp,  and  it  is 
representative  of  a  new  generation  of  object  oriented  programming  languages  that 
build  on  the  Smalltalk  and  Zetalisp  experience.  CommonObjects  provides  strong 
support  for  encapsulation,  m  particular  with  respect  to  inheritance.  Classes  are  not 
objects  in  CommonObjects,  just  as  types  are  not  objects  in  Common  Lisp.  The  access 
of  a  subtype  to  its  supertypes  is  restricted  to  the  same  abstract  interface  as  that 
presented  to  users.  Multiple  inheritance,  for  example,  from  two  classes  with  identical 
named  instance  variables  result  in  separate  copies  of  such  variables  when  an  abstract 
user  interface  is  used,  but  only  a  single  copy  if  a  non  abstract  user  interface  is  used 
Inheritance,  in  CommonObjects,  is  a  mechanism  for  defining  objects  whose  interlace 
include  all  the  operations  defined  for  another  class,  without  saying  anything  about  the 
internal  representation. 

3.  Inherited  Instance  Variable 

Generally  in  object  oriented  languages  the  code  of  a  class  can  directly  access 
ail  the  instance  variables  of  its  objects.  This  is  true  also  for  the  instance  variables 
defined  by  an  ancestor  class.  Permitting  access  to  instance  variables  defined  by  ancestor 
vlasses  compromises  the  encapsulation,  and  therefore  weakens  one  of  the  major 
benefits  of  object  oriented  programming.  This  problem  with  instance  variables  has 
resulted  in  many  different  implementations  [Ref.  39:  p.  40],  Flavors  [Ref.  2”j  does  not 
allow  merging  of  inherited  instance  variables  and  instance  variables  defined  locally  m  a 
..lass;  Smalltalk  signals  an  error  if  a  class  de tines  an  instance  variable  with  the  same 
name  as  ar.  inherited  instance  variable.  The  subclass  (in  Smalltalk'  inherits  both  the 
••  ,.r:able  declarations  and  methods  from  its  superclass,  but  gives  itself  a  new  class  name. 
1  he  ex.'tence  of  a  subclass  cannot  affect  the  superclass,  therefore  it  is  illegal  to  add  a 
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variable  name  to  a  subclass  that  is  declared  with  the  same  variable  name  m  its 
superclass.  Addition  of  shared  variables  will  make  them  accessible  to  the  instances  of 
the  subclasses  of  the  class.  This  also  means  that  a  subclass  has  the  same,  or  a  larger 
number  of  variables  than  the  superclass  (  i.e.  the  subclass  is  more  specialized). 

-1.  Programmer's  View  of  Inheritance 

1:  is  not  obvious  that  the  designer  of  a  programming  language,  or  a 
programming  enuronment,  thinks  of  inheritance  the  same  way  a  user  does.  For  the 
designer  programmer  the  purpose  of  inheritance  can  be  the  following  [Ref.  39:  p,  -llj: 

1.  A  private  decision  taken  by  the  designer  programmer  to  reuse  code  because  it  is 
useful  i saves  time)  to  do  so  for  him  her.  At  the  same  time  it  should  be  easy  to 
change  such  a  decision  later  on. 

2.  Making  a  public  declaration  that  objects  of  the  child  class  obey  the  semantics 
of  the  parent  class.  This  means  the  child  class  is  a  specialization  of  the  parent 
class.  Brachman  covers  this  in  depth  in  the  context  of  knowledge  representation 
[Ref.  42:  P.  S0-93>. 

For  the  programmers  in  object  oriented  programming,  a  single  object  may  look 
different  in  different  cases,  i.e.  multiple  views.  That  is  because  the  programmers  use 
different  parts  of  the  object,  and  manipulate  it  differently.  When  the  multiple  news  in 
addition  are  used  in  a  inheritance  hierarchy,  there  are  some  problems  with  how  the 
programmer  understand  his  programming  environment,  which  consists  of  a  large 
number  of  objects. 

In  an  integrated  interactive  programming  environment  it  is  important  that  the 
user  interface  is  consistent,  and  as  powerful  as  possible.  Multiple  inheritance,  without 
restrictions  caused  by  the  computer  system,  is  the  most  natural  to  use  for  the 
programmer.  On  the  other  hand  it  is  very  difficult  to  implement  multiple  inheritance  in 
an  integrated  software  system. 


D.  MULTIPLE  INHERITANCE 
1.  Overview 

The  most  general  way  to  achieve  multiple  inheritance  in  object  oriented 
languages  is  to  allow  arbitrary  intersection  of  class  boundaries,  like  in  Figure  6.3  which 
'hews  arbitrary  intersection  of  classes.  Multiple  inheritance  allows  a  situation  in  which 
some  objects  are  instances  cf  two  or  more  classes,  while  other  objects  are  instances  of 
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only  one  class  or  the  other.  Multiple  inheritance  is  a  very  powerful  technique  for 
reusability  of  code,  allowing  the  combination  of  more  than  one  previously  defined 
class  Multiple  inheritance  presents  problems  in  terms  of  what  is  to  happen  if  there  are 
multiple  paths  or  conflicting  instance  variables.  How  can  we  make  changes  to  the 
inheritance  hierarchy  safely?  If  the  use  of  inheritance  itself  is  globally  visible  (as  in 
most  implementations!  changes  to  the  inheritance  hierarchy  cannot  be  done  safely. 


Figure  6.3  Multiple  Inheritance. 


In  Smalltalk-80  the  implementation  description  can  be  modified  by  a  subclass 
as  follows  [Ref.  30:  p.  59): 

1.  The  class  name  must  be  overridden. 

2.  Variables  may  be  added. 

?.  Methods  may  be  added  or  overridden. 

To  override  a  method  means  that  if  a  subclass  adds  a  method  with  the  same  selector  as 
a  method  in  the  superclass,  the  subclass's  instances  will  respond  to  messages  with  that 
selector  bv  executing  the  new  method.  Standard  Smalltalk  does  not  support  general 
multiple  inheritance,  it  uses  a  tree  structure,  i.e.  each  class  has  only  one  superclass. 
I  he  pure  tree  structure  used  in  Smalltalk,  and  the  fact  that  there  are  no  hidden  side 
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effects  upon  other  objects  makes  the  language  easy  to  use.  In  the  tree  structure, 
essential  information  is  highlighted  on  one  level  and  the  details  are  specified  on  a  lower 
levei.  On  ail  the  layers  of  the  tree  but  the  lowest,  the  objects  are  fairly  complex.  Many 
of  the  ideas  in  this  section  is  from  Alan  Snyder's  work,  especially  from  his  paper: 

Fncapsulution  and  Inheritance  in  Object-Oriented  Programming  Languages" 
[Ref.  3*  p.  3 S — 15]. 

Multiple  inheritance  means  the  class  can  have  one  or  more  parents 
(superclasses).  A  class  can  be  viewed  as  forming  the  root  of  a  directed  acyclic 
inheritance  graph,  where  each  class  is  a  node  and  there  is  an  arc  from  each  class  to  its 
parent.  In  Smalltalk,  programming  language  objects  are  grouped  into  classes  (i.e., 
abstractions)  of  similarity  behaving  objects."  [Ref.  31:  p.  5]  An  example  of  multiple 
inheritance  is  shown  in  Figure  6.4  which  use  an  acyclic  graph  as  an  illustration  of  the 
problem. 


Figure  6.4  Fxample  of  Multiple  Inheritance  Acyclic  Graph. 

fhere  are  three  strategies  in  common  use  that  try  to  solve  the  problem  with 
multiple  inheritance  I  he  first  tries  to  deal  directly  with  the  acsJic  inheritance  graph, 
I  he  second  first  fattens  the  graph  into  a  linear  chain,  and  then  uses  the  rules  for  single 


inheritance.  The  third,  the  tree  solution,  avoids  the  problems  of  graph  oriented  and 
linear  solutions,  by  duplicating  nodes.  These  three  solutions  will  next  be  covered  in 
more  detail. 


2.  Graph  Oriented  Multiple  Inheritance  Solution 

I.xump'.es  cf  object  oriented  languages  that  model  the  inheritance  graph 
directly  are;  extended  Smalltalk  and  Trellis  Owl.  The  operations  in  these  two  languages 
are  inherited  along  the  inheritence  graph  until  redefined  in  a  class.  The  inheritance 
problem  arise  when  a  single  class  is  reachable  from  another  by  multiple  paths  as  in 
1  igure  o.4  where  the  graph  is  not  a  tree,  but  an  acyclic  graph.  [Ref.  34;  p.  42] 


I  :gure  6  5  lixample  of  Altered  Graph  Oriented  Multiple  Inheritance 


In  multiple  inheritance  one  class  can  have  more  than  one  parent  If  there  is 
an  one  parent,  and  the  class  inherits  operations  with  the  vine  identits  r.a.T.ei 
:re  than  one  parent,  there  is  an  identity  conflict.  This  identity  problem  van  be 
sec  f  igure  6  5,  by  redefining  o:  the  operations  m  the  subclass  ui.ildi. 
r.mges  a ; t h  the  graph  oriented  solution  are  that  lor  ar.v  ancestor  Gass  'here 
.ed  r.iy  one  vet  of  instance  saria'mes.  regardless  of  how  mans  p-t:>  ti.ere  are 
the  vlass  ;n  the  :nhcr:tu:.*e  graph  I  Ins  limits  the  programmer  s  freedom  tc  use 


1 


the  inheritance  within  a  class  without  the  possibility  of  destroying  some  descendant 
class.  Ihe  problem  is  most  serious  if  the  operation  is  invoked  on  the  same  set  of 
instance  variables  more  than  once,  and  if  the  operation  on  that  specific  class  has  side 
effects. 

3.  Linear  Chain  Multiple  Inheritance  Solution 

Flavors  and  CommonLoops  are  examples  of  languages  that  use  linear  solution 
to  solve  the  multiple  inheritance  problem  when  the  graph  is  not  a  tree,  but  an  acyclic 
graph.  These  two  programming  languages  first  flatten  the  acyclic  graph  to  a  linear 
'.ham,  without  duplicates."  Thereafter  the  result  is  treated  as  single  inheritance.  See 
Figure  6.6  for  a  visual  representation  of  the  solution.  Algorithms  in  the  language  create 
a  total  ordering  that  preserves  the  ordering  along  each  path  through  the  inheritance 
graph,  but  unrelated  classes  may  be  inserted  between  a  class  and  its  original  parent. 

I  he  computed  inheritance  chain  may  have  the  property  that  parent  of  a  class  B  may  be 
a  Jass  A  with  unknown  content  to  the  designer  of  class  A.  [Ref.  39:  p.  43] 


the  fact  that  unrelated  classes  may  he  inserted  between  a  class  and  its  original  parent. 
This  inserted  class  may  redefine  >ome  operation  so  that  the  communication  between 
the  child  and  the  original  parent  is  broken. 

d.  Tree  Conversion  Multiple  Inheritance  Solution 

Both  the  graph  oriented  and  the  linear  solutions  have  some  drawbacks,  as 
described  earlier.  The  tree  solution  avoids  these  problems  by  duplicating  nodes,  see 
figure  t\4  for  a  visual  representation  of  the  solution.  There  are  still  unresolved 
problems  with  multiple  inheritance  because  the  inheritance  is  not  close  enough  to  the 
user  s  view  of  multiple  inheritance. 

CommonObjects  is  an  example  of  object  oriented  languages  where  the 
semantics  models  the  inheritance  graph  'Ref.  ?d:  p.  44]: 

1.  Regardless  of  the  source  it  is  illegal  to  inherit  an  operation  from  more  than  one 
parent. 

2.  Each  parent  of  each  class  defines  a  completely  separate  set  of  inherited  instance 
variables.  The  acyclic  graph  is  converted  into  a  tree  by  duplicating  nodes,  and 
each  path  creates  a  separate  set  of  instance  variables. 

Situations  are  therefore  avoided  where  an  operation  can  accidently  be  invoked  multiple 
times  on  the  same  set  of  instance  variables,  or  where  two  classes  conflict  in  their  use  of 
uii  inherited  class.  In  CommonObjects  these  problems  are  corrected  b\  restricting  the 
.w-ess  to  the  inherited  instance  variables.  The  access  is  provided  m  the  firm,  of 
operations,  and  the  multiple  inheritance  supports  encapsulated  class  definitions. 


E.  SIMM  ARY  OF  THE  CHAPTER 

Inheritance  in  object  oriented  programming  languages  facilitates  reuse  of  mde 
b  ::w  are  modules  may  be  defined  as  extensions  <  specializations  >  of  previously  defined 
-  :: v. ire  modules. 

Single  inheritance  is  'he  case  where  the  inheriting  class  directly  inherits  ir  un  a 
vt.g.e  .lass,  the  parent.  In  our  definition  of  object  oriented  programming  the  icat  .re 
i.r.crit ar.ee  hierarchy  is  the  one  that  supports  the  simulation  paradigm,  and  that  makes 
t:.e  .■  yewt  oriented  languages  -c  special. 

o  dm! js is  :c  allow  a  class  to  include  all  instances  ol  another  class,  but  •  : 
v.  m  me  cer.eral  sharing  ;r.  .!../'  membership.  ;.e  not  mult. pie  mheritame  1  he 


more  general  way  to  achieve  multiple  inheritance  in  object  oriented  languages  ; 
ailow  arbitrary  intersection  of  class  boundaries.  There  are  three  ccmmcr.h  ; 
methods  to  solve  the  problem  of  implementing  multiple  inheritance:  graph  onei 
solution,  linear  chain  solution,  and  tree  conversion  solution. 


MI.  INTERACTIVE  PROGRAMMING  ENVIRONMENT 


WHAT  IS  AN  INTERACTIVE  PROGRAMMING  ENVIRONMENT? 
1  Definitions 


Maintenance  tools, 

a.  test  program  generators 


b.  tracers. 

c.  dumps  and  dump  interpreters, 
d  version  control. 

e  source  code  control. 

Performance  tools. 

a.  measurement  tools  (histogram  generators i. 

b.  analytic  performance  estimator. 

c.  benchmarking. 

d.  optimizers  ‘both  object  and  source  codei. 
Minor  tools. 

a.  copy  utilities. 

b.  directories. 

c  file  uderencer 


Impact  of  Tools 


Soft-cure  tools  are  becoming  more  and  more  important  in  develc 
tier,  areas  fer  computer  s\  stems  This  transformation  to  interactive 
r.mmg  environments  provides  the  user  with  conversational  access 
t.or.  means  that  the  different  tods  are  designed  to  fit  together,  and  a 
ether  >  features.  [  he  integration  increases  the  power  o!  'he  er.v 
:  the  -urn  'eta!  of  the  system  ;s  larger  than  the  sunt  of  the  single  to 
co  group  c;  users  often  view  the  ss stem  only  as  a  tod  that  m.n  help 
"  James  Burke  stated  it  erne  '  Ret  dd1 
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,.t  ’.uc  moment  man  first  picked  an  a  stone  or  a  branch  to  use  as  a  to 
c  adored  rrev ocabiv  the  miance  between  him  and  his  environment  I  : 
tis  nomt  :n.  the  uviv  in  which  the  wo  rid  around  hint  changed  a 
derent  It  was  no  longer  regular  m  predictable.  Mew  one.ts  j'ppe.u 
.it  were  n.t  recognizable  as' a  :r...tat;on  U  something  ’.nr.  had  <i\o 
etore  and  as  each  one  emerged  it  altered  the  environment  no*  mr  a  se.iy 
ut  forever  \\  rule  the  number  d  these  r-  ds  reni  ui.ed 
k  a  long  time  ti  sprc.sd  and  -.a -sc  chance  B~t  as  i.c  mme 
i.u  their  enacts,  the  n.  re  '  m.s. 


-sc  change  B.U  ac  U.c 
ucr  tit c  rate  cl  dt aitcc 


The  designer  s  programmer  s  environment  ml'.uer.ces.  mu  uter.  uen 
,u  d  rrcblems  he  she  can  and  w-.l  want  "c  solve,  how  tar  he  she  c  m 
1  here:  re  me  m  crammer  -an  vie-  ’c  more  time  and  enerc-  v  me 


mere  ambitious  and  more  productive.  The  avaiiable  programming  tools  are  a  critical 
part  of  the  environment,  and  much  work,  today  is  going  or.  to  implement  and  test  new 

teds 

'  What  is  so  Special  about  Programming  Environments 

Historically  programming  environments  have  been  used  to  help  the 
designer  programmer  in  the  development  and  maintenance  of  software.  In  the 
beginning  they  used  noncomputerized  tools  like  coding  forms,  pencils  etc.  Soon  this 
c'.  T\ed  into  computerized  tools  hke  assemblers,  compilers  cid  high  level  programming 
languages.  The  designers  and  programmers  built  tool6  to  help  themselves,  sy  mbolic 
debuggers  to  aid  m  the  debugging  process,  operation  systems  and  file  systems  to 
manage  the  computer  system,  text  editors  to  ease  entry  and  modification  of  'he 
program  text,  and  so  on. 

1  he  unicue  thing  about  an  interactive  integrated  programming  environment  :s 
•hat  .'.  is  used  to  represent  and  develop  other  programs,  and  to  manipulate  these  ether 
programs.  Many  of  the  abstract  types  are  entities  tn  the  program  the  programmer  is 
developing  m  the  interactive  programming  environment.  The  distinction  '•'etween  the 
program  that  w  manipulated,  and  the  programming  environment  ;n  which  it  w  Hung 
man. related  becomes  less  and  less  over  tune.  In  an  interactive  integrated  prog:  miming 
environment  the  programmer  may  interweave  activities  without  loosing  accumulated 
' rniat;.  tt.  and  without  giving  up  capabilities  I  -  picaih. .  :r.  programming 

.s  mpe/cr  rcv‘urccs  lire  j/irit  tf.c  ‘.vrk.  ;r«  procr  'o  vr* c 

I .  d av  we  have  a  varictv  of  programming  environments,  bat  there  exist  no 
uef.t.eu  - .  iss;;Hat:cn  of  them  Barstow  and  Shrebe  'Re!  u  p  no* »*  suggest  the 
■ .  c  ’ : .  t e  e  ..asses 

:  c- min.mc  er.v  ircr.nients  concerned  with  die  entire  lace*  He  of  a  rroeram  or 


i  grammmg  environments  cc: 
a  riose  *o  us  ,*re  relative.’,  indere 
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Prcgranum.ng  environments  which  reg  ire  coding,  debi.gg.ng.  testing 
maintenance  as  a  sir.cle  process  o:  procram  development  mr  u.ch  .iwreme: 
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integrated  programming  environments  like  Interlisp  and  Smalltalk.  These  two 
environments  are  relatively  easy  to  extend  to  take  care  of  the  whole  life  cycle  also  u.e. 


:tass  one). 


B.  IDENTITY  OF  OBJECTS 


Definition  of  Identity 


Identity  is  that  property  of  an  object  which  distinguishes  each  object  from  all 
others.  'Ret.  45:  p.  Two  dimensions,  at  least,  are  involved  in  identity,  the 

representation  dimension  and  the  temporal  dimension.  The  representation  dimension 
classifies  the  programming  languages  based  on  whether  they  represent  the  identity  of 
an  object  by  a  user  specified  name,  by  its  value,  or  if  it  is  built  into  the  programming 
language  itseif.'  [Ref.  45:  p.  414]  The  temporal  dimension  classifies  the  programming 
languages  ba«ed  on  whether  they  preserve  their  representation  of  identity  within  a 
single  program  or  transaction,  between  transactions,  or  between  structural 
reorganizations.  '  [Ref.  45:  p.  40“] 

2  Identity  in  Interactive  Programming  Environments 

Most  programming  languages  do  not  differentiate  between  addressability  and 
luer.titv.  md  me  the  v unable  names  as  the  only  was  to  distinguish  temporary  classes 


•p  >  an  entry  in  an  object  table,  and  we  say  that  the  identity  is  implemented 
•hr  ugh  a  level  of  indirection.  Indirection.  :e  indirect  phy veal  or  virtual  lddress 
ui.rlementat:  ms  would  allow  individual  classes  ,  objects,  to  be  moved  within  one 
:ddress  space.  I  his  would  provide  !u!i  data  independence,  but  net  allow  sharing  cl 
*"cs  •  objects'  among  multiple  programs  The  oops  are  the  most  common  form  o! 
,..ita  manipulated  by  the  Smalltalk-''"  interpreter 

In  object  oriented  sy stems,  like  Smalltalk,  the  instance  variables  determine  me 
...rrerf  s*ate  afar,  c’rject  .vhile  the  met:. cos  uof.ned  m  the  object  s  v.ass  determine  r.ovv 
me  -  em  behaves  It  .s  poss;'-Je  that  tw  r  or  more  objects  max  ^e  identica.  in  a.. 

...  ,  ;n.  s.  . .  .•  represent  mg  different  real  world  objects  I  he  identity 

ueperuem  •  '  the  .  ~  -  s'j'j.  md  merer  re  distinguishes  earn  object  ever,  in  the 


.  e  .mere  me  i:,r  "e 


r  nern. .inert.'.  ■  .dentie.il  in  a.i  properties  w;tr. 


The  identity  is  very  important  -vhen  building  interactive  integrated 
environments  using  object  oriented  programming  languages.  The  tools  are  using  each 
other,  therefore  distinct  identities  are  critical  to  prevent  identity  conflicts. 

3  What  Language  to  use  in  an  Interactive  Programming  Environment 

In  theory  all  programming  languages  are  equally  powerful.  To  actually  write 
the  code  m  different  languages  may  be  more  cr  less  easy  to  do  for  the  programmer,  but 
that  is  irrelevant  as  a  theoretical  measure  of  power. 

In  general,  programming  languages  are  more  suitable  for  certain  jobs  than 
others:  Basic  is  easy  to  learn  and  is  good  for  small  dialogue  oriented  applications, 
Fortran  is  well  suited  for  numerical  applications.  Cobol  is  tailored  to  business  data 
processing.  Pascal  is  designed  for  teaching  structured  programming,  Ada  is  ideal  for 
large  embedded  systems.  Lisp  is  very  good  for  processing  symbolic  information. 
Smalltalk  is  designed  for  simulation.  APL  for  manipulation  of  vectors  and  matrices.  C 
for  system  programming.  Simula  for  discrete  simulation,  etc. 

An  interactive  integrated  programming  environment  consists  of  a  set  of 
computerized  tools  that  is  designed  to  help  the  user  of  the  system.  The  programming 
environments  take  cognizance  not  only  of  the  technical  nature  of  the  software 
construction  process,  but  also  of  the  social  environment  in  which  it  is  actually  used. 
The  programming  task  takes  place  in  various  managerial  and  social  settings,  so  the 
computerized  tools  appropriate  in  one  context  and  may  be  inappropriate  in  another. 

Currently  it  seems  to  be  easier  to  implement  an  interactive  integrated 
programming  environment  in  an  interpreted  language  with  dynamic  binding  than  in  a 
traditional  compiled  language.  Lisp  and  Smalltalk  will  therefore  be  covered  in  more 
detail  in  relation  to  interactive  integrated  programming  environments.  In  Smai!taik-Sii 
the  incremental  program  execution  of  Lisp  is  combined  with  Simula's  class  subclass 
and  virtual  concepts.  Smalltalk  and  Lisp  have  a  lot  in  common:  a  fat  set  cf  definitions 
'Classes?,  dynamic  name  binding  and  run  time  type  checking.  Both  Lisp  and  Smalltalk 
have  a  main  loop  written  in  itself.  Smalltalk  or  Lisp  respectively.  The  loop:  read  \ 
command,  execute  the  command,  print  the  resuit,  and  loop.  And  both  languages 
support  exploratory  software  development. 
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4.  Incremental  Program  Development 

Complex  problems  are  often  difficult  to  specify  and  design.  We  often  know  we 
have  a  problem,  but  exactly  what  causes  the  problem  and  how  to  solve  it  may  well  be 
more  difficult  to  find.  A  software  program  will  normally  go  through  a  series  of  changes 
over  its  hie  cycle.  In  the  beginning  it  may  exist  as  a  loose  mental  description  of  what 
the  designer  programmer  wants  the  program  to  perform.  This  in  turn  may  (or  may  not) 
evolve  mto  a  more  formal  specification,  which  in  turn  may  evolve  into  a  design,  and 
finally  become  code  in  some  programming  language.  Maintenance  may  also  be  done 
m  the  same  fashion.  See  Figure  7,1  for  a  visual  presentation  of  this  incremental 
program  development. 


Figure  7.1  Incremental  Development. 


The  procram  may  evolve  as  a  series  of  experiments,  in  which  the  result  from 
one  steD  gives  the  input  to  the  design  of  the  next  step.  During  this  process,  the 
progr  ;rn  may  undergo  drastic  changes  as  the  problem  is  better  understood.  The  simple 
structure  the  programmer  starts  with  grows  by  increasing  the  complexity  of  the 
modules.  live  enhancement  process  continues  recursi.ely  until  a  finished  product  exists, 
live  growth  can  occur  both  'horizontally'  through  the  addition  of  more  facilities,  and 
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'vertically'  through  a  deepening  of  existing  facilities  and  making  them  more  powerful  in 
seme  sense."  [Ref.  46:  p.  63] 

In  an  ideal  interactive  integrated  programming  environment  all  of  this  process 
would  take  place  within  the  computer  system,  using  its  resources  (tools,  hardware,  etc.) 
to  help.  The  user  of  this  kind  of  system  seems  to  be  more  of  an  artist"  than  an 
operator.  He  she  wiiJ  have  the  ideas,  but  use  the  system's  resources  to  test  and 
implement  the  program  using  incremental  programming  development. 

MacLennan  [Ref.  43]  defines  system  development:  "The  entire  process  that 
takes  an  initial  idea  in  the  client  s  mind  to  a  final  constructed  system  that  satisfies  the 
client.  That  is,  the  entire  life  cycle'  of  the  system,  including  later  evolution  to  keep  the 
client  satisfied.''  This  definition  shows  how  dependent  the,  often  computer  illiterate, 
client  is  on  flawless  communication  with  the  designer  programmer  in  order  to  get  what 
he  she  wants. 


C.  HOW  TO  PUT  THE  USER  IN  CONTROL 

The  following  three  principles  can  help  the  average  user  of  a  computer  system  to 
reel  in  control  of  the  computer  resources: 

1.  Responsiveness,  that  is  that  the  user's  action  at  the  computer  should  have 
direct  visible  resuits. 

2.  Permissiveness  is  to  let  the  user,  not  the  system,  decide  what  to  do  next.  The 
system  should  appear  modeless  to  the  user. 

3.  Consistency  is  to  use  the  same  interface  for  the  whole  environment,  and  for  ail 
applications  in  the  environment. 

Some  producers  of  personal  computers,  like  Apple's  Macintosh  and  Commodor  s 
Amiga,  base  applied  these  principles  in  their  systems.  They  have  to  a  large  degree 
managed  to  put  the  user  in  control,  but  the  cost  has  been  much  more  complicated 
application  software.  The  burden  has  been  moved  from  the  user  to  the  hardware  and 
the  programmers  of  application  software. 


D.  LISP  IN  INTERACTIVE  PROGRAMMING  ENVIRONMENTS 


1  Why  use  Lisp 

The  basic  syntax  in  Lisp  is  very  simple,  and  the  programs  are  naturally 
represented  in  simple  Lisp  data  structures  in  a  way  that  reflects  the  structure  of  the 
program.  Lisp  represents  both  data  and  programs  with  lists,  therefore  it  is  simple  to 
write  Lisp  programs  that  read,  preprocess,  transform,  and  generate  other  Lisp 
programs.  Lisp  requires  no  declarations  and  therefore  programs  can  be  created 
incrementally,  this  would  normally  be  difficult  in  a  declarative  language.  The  dynamic 
type  system  and  flexible  data  structures  make  Lisp  well  suited  for  badly  specified 
problems,  and  very  well  suited  for  experimentation.  The  interpreter  in  Lisp  performs 
only  one  action,  applying  a  function  to  its  argument,  therefore  features  (tools)  like 
single  stepping,  tracing,  and  symbolic  debuggers  are  easy  to  implement.  In  addition 
the  simple  syntax  of  Lisp  makes  logical  presentation  of  code  on  a  screen  or  on  a  page 
natural  and  easy.  The  compiler  some  Lisp  implementations  have,  is  there  just  to  speed 
up  the  execution.  The  interpreter  giues  ail  the  different  tools  together  into  an  integrated 
system.  It  is  never  necessary  tor  the  designer  programmer  to  think  of  his  her  code  as 
anything  ether  than  the  source  code.  Such  a  view  is  in  principle  possible  also  in 
completely  compiled  programming  languages,  but  it  is  much  harder  to  achieve.  The 
ease  with  which  Lisp  programs  can  manipulate  other  Lisp  programs  has  given  us  a 
wide  variety  of  Lisp  programming  tools.  This  library  of  programming  tools  evolved 
into  a  programming  environment  that  supports  all  the  phases  of  programming:  design, 
coding,  debugging,  documentation,  and  maintenance.  Interlisp  is  one  example  of  the 
early  programming  environments  developed  around  Lisp  programming  tools. 

2.  The  Interlisp  Programming  Environment 
a.  Introduction  to  Interlisp 

Interlisp  is  an  interactive  integrated  programming  environment  based  cn 
Lisp.  It  is  in  extensive  use,  and  has  an  extensive  set  of  user  facilities;  including  syntax 
extension,  uniform  error  handling,  automatic  error  correction  (DWIM).  an  integrated 
structure  based  editor,  a  sophisticated  debugger,  a  compiler,  and  a  filing  system 
Ref.  -T:  p.  25-34],  The  system  is  used  at  many  sites,  mostly  at  education  centres 
•  universities),  and  it  is  well  documented  and  maintained.  The  Interlisp  environment 
has  evolved  ever  time  in  an  incremental  fashion.  Therefore  the  quality  of  the  user 
interface  has  been,  and  still  is.  iess  than  desired.  The  interfaces  are  inconsistent  and 


complex,  and  it  is  difficult  to  master  all  the  tools  and  facilities.  The  unique  thing  about 
Interlisp  is  the  following  two  attributes: 

1.  The  high  degree  of  integration. 

2.  How  easy  the  facilities  t. tools i  in  the  environment  can  be  tailored,  modified,  and 
extended. 

b.  Some  Facilities  in  [nterlisp 

The  following  section  will  discuss  some  of  the  important  facilities  in 
Interlisp.  Many  of  the  ideas  are  taken  from  Warren  and  Masinter:  "The  Interlisp 
Programming  Environment"  [Ref.  4~j. 

The  residental  system  is  defined  as  a  system  where  the  primary  copy  of  the 
program  resides  in  the  programming  system  as  a  data  structure.  The  user  makes 
changes  to  this  copy  during  the  interactive  session,  i.e.  editing  is  done  by  modification 
to  this  data  structure. 

The  file  package  is  defined  as  a  set  of  functions,  and  interfaces  to  other 
system  facilities  and  tools.  The  user  does  not  have  to  keep  track  of  where  things  are. 
and  which  things  have  changed.  In  modern  Interlisp  the  file  package  normally  operates 
automatically,  transparent  to  the  user.  The  user  no  longer  has  to  worry  about 
maintaining  his  source  files,  but  if  he  she  wants  to  make  changes  to  this  automatic 
bookkeeping  it  is  easy  to  redefine  or  change  these  operations.  The  general  file  package 
supports  the  abstraction  that  the  user  manipulates  his.  her  program  as  data  while  the 
file  is  just  one  of  the  possible  representations  of  the  code. 

Masterscope  is  an  interactive  program  for  analyzing  and  cross  referencing 
user  programs  in  order  to  predict  the  effect  of  a  proposed  change  to  the  program. 
Masterscope  "determines  which  functions  are  called,  how  and  where  variables  are 
bound,  set,  or  referenced,  which  functions  use  particular  record  declaration,  etc." 
;Ref.  d"’:  p.  30)  When  Masterscope  performs  its  analysis  it  builds  a  database  of  the 
result.  The  user  has  access  to  this  database,  and  can  interrogate  using  English  like 
queries.  In  addition  Masterscope  can  call  the  editor  on  all  functions  that  contain 
expressions  that  satisfy  certain  relationships  specified  by  the  user.  Masterscope  adds 
another  level  of  abstraction  to  the  system  because  the  user  no  longer  has  to  remember 
what  was  changed,  and  therefore  needs  new  analysis,  but  the  system  takes  care  of  this. 
The  interaction  between  different  functions  are  done  automatically,  and  transparent  to 
the  user. 


So 


Do  What  l  Mean  ,DWIM>  is  the  feature  that  facmates  me  the  most  I  he 
system  invokes  DWIM  when  it  detects  an  error,  then  the  DWIM  attempts  to  guess 
what  the  user  intended  to  do.  DWIM  is  a  collection  of  programs  that  makes  resonabie 
interpretations  when  giver,  unrecognized  inputs  at  user  level.  The  DWIM  svstem  is 
transparent  to  the  user,  and  is  an  important  part  of  the  user  interface.  The  simplest, 
and  most  visible,  part  of  DWIM  is  the  spelling  corrector  which  attempts  to  find  the 
closest  match  within  a  list  of  relevant  items.  This  list  is  easily  modified,  so  that  the  user 
can  tailor  it  to  his  her  needs.  The  spelling  corrector  can  be  used  to  enforce  standards 
etc.  i.e.  DWIM  automatically  transforms  input  to  a  standard  syntax. 

The  Programmer  s  Assistant  is  an  active  intermediary  between  the  user  and 
the  lower  levels  of  the  system.  "The  programmer's  assistant  records,  in  a  data  structure 
called  the  history  list,  the  user's  input,  a  description  of  the  side  effects  of  the  operation, 
and  the  result  ol  the  operation"  [Ref.  4":  p.  32).  The  "undo"  command  is  closely 
related  to  the  history  list.  As  long  as  the  user  doesn't  tell  the  system  to  do  otherwise, 
the  programmer's  assistant  will  always  be  part  of  the  user  interface.  In  most  cases  the 
programmer's  assistant  is  transparent  to  the  user,  and  responds  to  commands  that 
manipulate  the  history  list.  The  history  list  keeps  track  of  what  the  user  has  typed,  so 
that  keyboard  input  can  be  reused  while  just  specifying  what  has  changed.  Interlisp 
records  absolutely  every  change  to  the  structure,  but  it  does  not  record  why  it  was 
changed. 

c.  Interlisp-D  Programming  Environment 

Interlisp-D  is  in  general  Interlisp  with  windows  added  to  it.  and  it  is  an 
example  of  a  single  user  virtual  memory.  The  user  sees  the  interactive  integrated 
programming  environment  as  a  collection  of  windows.  Each  window  corresponds  to  a 
different  tool,  task  or  context.  The  introduction  of  bitmapped  displays,  pointing 
devices,  and  windows  has  greatly  enhanced  the  user  interface  of  the  Interlisp-D  system. 

One  of  the  major  disadvantages  with  standard  Interlisp  is  its  cumbersome 
user  interface,  and  Interlisp-D  solves  some  of  the  problems. 
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1  Why  use  Smalltalk 
a.  Introduction 

The  Software  Concepts  Group  at  Xerox  Paio  Alto  Research  Center  had  a-, 
the  coal  to  create  a  powerful  information  system,  one  m  which  the  user  car.  store, 
access  and  manipulate  information  so  that  the  system  can  grew  as  the  user's  ideas 
grow.  Both  the  number  and  kinds  of  system  components  should  grow  m  propenen  to 
the  growth  cl  the  users  awareness  of  hew  to  effectively  use  the  svstem" 
Ref.  ?0:  p.vil].  The  philosophy  is  to  choose  general  principles  and  apply  titem 
umfcrmiy  That  means  that  if  we  have  built  some  objects,  we  should  always 
use  enhance  these  existing  objects  when  possible  instead  of  creating  new  ones  from 
scratch.  The  Smalltalk  system  lets  the  user  make  changes  to  the  system  itself  while  it  is 
running,  :.e.  the  user  may  crash  the  system  by  modifying  some  objects  that  are  critical 
for  the  system. 

The  specification  of  Smalltalk-SO's  virtual  machine  describes  the  required 
'•'chavior  of  any  interpreter.  An  implementation  of  a  Smailtalk-80  interpreter  is  only 
required  to  exhibit  external  behavior  which  is  identical  to  that  described  by  the  formal 
specification  as  it  appears  in  "Smalltaik-SO:  The  Language  and  Its  Implementation"  by 
Goldberg  and  Robson.  As  long  as  the  external  behavior  is  preserved,  the 
implementation  can  make  design  tradeoffs  to  increase  the  performance,  or  meet  special 


The  choice  of  a  programming  language  to  implement  the  Smailtaik-SO 
interpreter  is  based  on  the  tradeoff  between  the  performance  needed,  and  the  ease  of 
implementation.  The  interpreter  depends  on  an  efficient  mapping  of  the  virtual 
machine  architecture  onto  the  available  hardware  resources  of  the  processor.  The 
resources  include:  registers,  preferred  memory  locations,  instruction  sequences,  etc. 
Generally  a  low  level  (assembly)  language  gives  the  implementor  total  freedom,  but 
he  she  must  also  take  total  responsibility  for  correct  programming.  This  is  in  contrast 
to  high  level  languages  where  the  designer  of  the  compiler  did  the  general  resource 
allocation  (which  often  is  not  optima!  for  the  Smalltalk  interpreter). 
b.  Features  in  Smalltalk 

The  user  interface  consists  of  many  facets  and  in  the  following  du*.  uss.cn 
seme  cf  the  mere  important  cues  are  studied  more  in  detail 


Views  are  the  rectangular  areas  on  the  display  screen.  Views  may  contain 
only  text,  only  pictures,  or  a  combination  of  the  two.  Views  are  the  same  a»  windows. 

:  e.  ;c  select  a  view  is  to  enter  a  window. 

The  browser  is  a  view  of  the  classes  in  the  Smalltalk.- )  s\  stem.  New  classes 
are  added  to  the  system,  and  existing  classes  are  examined  and  changed,  using  the 
browser.  Programmers  in  Smalltalk-SO  define  classes  and  methods  incrementally  by 
editing  m  s>  stem  browsers.  To  be  able  to  share  these  eiass  descriptions  with  others, 
files  are  used  for  communication.  The  files  are  called  "code  files"  and  allow  the  user  to 
communicate  source  code  between  one  Smalltalk,  system  and  another.  The  files  can 
also  be  used  to  communicate  information  from  the  system  to  itself  at  a  later  point  in 
lime.  The  file  that  stores  changes  does  so  by  appending  to  it.  and  therefore  previous 
versions  of  the  source  code  can  always  be  found  easily.  This  file  also  records  several 
other  kind  of  information  m  order  to  help  recover}’  after  a  system  crash:  it  marks 
execution  of  an  expression  in  a  code  generator,  occurence  of  a  snapshot,  etc.  The  file 
format  is  used  by  the  system  to  keep  the  source  code  for  the  methods  on  disk  files, 
rather  than  within  the  memory  of  a  resident  system. 

Crror  reporting  is  supplied  with  notifiers  and  debuggers.  The  process  in 
which  the  error  is  encountered  is  suspended  and  a  view  of  this  process  is  created.  The 
nctiliers  give  a  description  of  the  process  at  the  time  the  error  was  encountered.  The 
debugger  generally  gives  a  more  detailed  view,  but  also  allows  the  user  to  change  the 
state  of  the  suspended  process  before  resuming  it. 

These  interfaces  aiong  with  the  fact  that  Smalltalk  lets  the  user  change  the 
system  itself,  so  that  it  may  crash,  made  it  apparent  that  it  was  necessary  to  save  to 
disk  the  entire  state  of  the  system  at  certain  times.  This  is  called  a  "snapshot"  of  the 
system,  and  is  currently  performed  automatically  from  time  to  time.  In  addition  the 
user  can  invoke  the  snapshot  when  needed.  When  a  critical  error  occurs,  the  user 
"boots  and  resumes'  his  her  work  from  the  previous  state  saved  in  the  last  snapshot. 
In  Smalltalk-80  the  snapshot  is  represented  by  the  virtual  image  format.  The  "changes  ' 
hie  is  only  altered  by  appending  data  to  it,  therefore  any  previous  version  can  be 
found. 

c.  Smalltalk  as  a  Programming  Environment 

The  programming  process  is  assisted  by  several  classes  in  Smalltalk- s<‘». 
Different  classes  are  used  to  represent  the  user  readable  code  and  the  machine 
executable  form  of  methods.  Objects  arc  used  to  represent  parsers,  compilers,  and 
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er'.  Decompilers  translate  between  the  different  representations  < method 
representing  classes  connect  methods  with  the  objects  that  use  them. 
objects  representing  organizational  structures  for  classes  and  methods  help  t 
tier  keep  track  of  the  system  s  state,  and  objects  representing  histories 
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so::  -  are  changes  help  interface  with  other  programmers.  Finally  objects  also  represent 
the  eveo..’:cn  state  of  a  method;  they  are  called  contexts,  and  are  similar  to  stack 
frames  or  activation  records  of  other  programming  systems.  Everything  in  Smalltalk  is 
an  object,  and  any  object  can  be  bound  to  any  name  because  no  names  are  t>ped. 
Smalltalk  has  dynamic  type  checking,  like  Lisp,  whiie  for  example  Pascal  and  Ada  ha\e 
static  type  checking.  Dynamic  type  checking  lets  Smalltalk  allow  a  message  to  be  sent 
;o  an  object  only  if  that  object  has  a  method  to  respond  to  the  message.  Any  object 
with  the  proper  protocol  may  be  passed  to  a  method.  In  Smalltalk  it  is  not  possible  to 


crash  the  system  due  to  type  violation. 

The  uniformity  of  Smalltalk  is  only  valid  within  the  system  itself.  It  is  not 
possible  to  maintain  this  uniformity  in  the  interfaces  to  the  external  world,  because  the 
external  world  consists  of  disk  files,  printers,  etc.  that  are  not  Smalltalk  objects.  All 
programs  that  want  to  share  information  with  some  other  program  meet  this  problem. 

Smalltalk  is  a  graphically  oriented  interactive  integrated  programming 
environment.  The  language  is  designed  so  that  all  components  in  the  system  that  are 
accessible  to  the  user  can  be  presented  in  a  meaningful  way  for  manipulation  and 
observation.  Smalltalk  builds  on  the  model  of  independent  communicating  objects. 
Applications  written  in  the  language  are  viewed  in  the  same  way  as  the  fundamental 
units  from  which  the  system  itself  is  built.  Interaction  between  the  most  primitive 
objects  is  viewed  in  the  same  way  as  high  level  interaction  between  the  computer  and 
the  user.  The  pure  object  oriented  programming  languages  are  ideally  suited  for 
interactive  integrated  programming  environments.  A  person  working  at  a  terminal 
responds  to  conditions  and  takes  actions  in  time. 

Most  traditional  systems  are  built  around  a  kernel  of  code  which  cannot 
easily  be  modified.  In  the  Smalltalk  systems  the  kernel  consists  of  machine  code  and 
mu.rocc.de  in  order  to  implement  a  virtual  machine.  The  kernel  must  be  as  small  as 
possible  to  prevent  the  need  for  frequent  changes  to  it.  Smalltalk  facilitates 
increment  .si  design  of  the  system,  therefore  we  have  the  problem  of  ensuring  the 
mteari'.y  of  the  system.  1  he  system  tracer  takes  care  of  this  problem.  It  ;s  a  program 
running  inside  of  Smalltalk  that  copies  the  whole  system  cut  to  a  file  while  it  is 
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The  svstem  tracer  lets  the  user  li\ e  in  the  svstem  he  she 
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Without  having  to  start  from  scratch  even,'  time  the  system  crashes.  One  of  the  benefits 
from  the  system  tracer  is  that  it  makes  it  easier  to  use  a  fully  interactive  integrated 
programming  environment  for  production  applications.  The  total  system  contains 
many  tools  facilities  that  are  not  needed  in  production  systems,  i.e.  compiler,  debugger, 
editor,  communication,  etc.  The  system  tracer  has  the  ability  to  strip  oil'  most  of  these 
unnecessary  tools  facilities.  In  addition  the  system  tracer  can  be  used  to  produce 
mutations '  of  existing  programs.  An  example  of  this  is  hove  we  can  change  the 
floating  point  number.  We  can  include  an  appropriate  transformation  in  the  system 
tracer  and  write  out  a  mutation  of  the  old  program,  then  we  can  replace  the 


routine  instruction  m  the  virtual  machine  and  start  up  again  with  a  modified  system. 
This  modification  method  has  been  used  in  Smalltalk  to  changei  floating  point 
numbers,  instruction  set  of  the  virtual  machine,  format  of  compiled  methods,  and 


encoding  of  small  integers  [Ref.  23:  p.  26J. 

Some  computer  scientists  feel  that  object  oriented  programming  languages 
can  take  over  some  of  the  roles  from  the  operating  systems.  Daniel  Ingalls  stated  it 
even  stronger:  An  operating  system  is  a  collection  cf  things  that  don't  fit  into  a 
language.  There  shouldn't  be  one.''  [Ref.  4S] 


F.  SUMMARY  OF  CHAPTER 

The  unique  thing  about  interactive  integrated  programming  environments  are 
that  they  are  used  to  represent  and  develop  other  programs,  and  to  manipulate  these 
other  programs.  The  interactive  integrated  programming  environment  consists  of  a  set 
of  tools,  and  a  tool  is  anything  that  can  help  in  the  programming  process.  The 
integration  of  tools  are  very  hard,  and  today  it  is  easier  to  do  in  an  interpreted 
programming  language  with  dynamic  binding,  like  Lisp  or  Smalltalk  than  in  a  compiled 
programming  language.  In  addition  Lisp  and  Smalltalk  facilitate  incremental  program 
development,  i.e.  the  program  evolves  as  a  senes  of  experiments  in  which  the  result 
from  one  step  gives  the  input  to  the  design  of  the  next  step. 

The  user  interface  may  be  enhanced  if  the  following  principles  are  followed: 
re'por.Mvenc''.  permissiveness,  and 
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VIII.  CONCLUSIONS  AND  RECOMMENDATIONS 


A.  CONCLUSIONS 

The  structure  of  our  language,  also  programming  languages  and  programming 
environments,  define  the  boundaries  of  human  thought.  Therefore,  e'.en  if  the 
theoretical  power  of  all  languages  are  the  same,  programming  languages  and 
programming  environments,  at  the  moment,  limit  human  creativity  and  ability  to  soive. 
new.  large,  and  complex  problems.  Today  s  programming  languages  and  programming 
environments  are  specialized,  and  net  well  suited  for  all  types  of  problems. 

Increased  emphasis  is  put  on  the  user  interface.  We  have  a  lot  of  background 
knowledge  in  this  area,  but  at  the  moment  we  don't  know  exactly  how  to  produce  the 
optimum  user  friendly  interface.  Bitmapped  displays,  menus  and  mouse-like  devices 
have  so  far  been  the  solution.  The  burden  is  moved  from  the  user  to  the  hardware  and 
programmers  of  application  software.  The  unique  thing  about  interactive  integrated 
programming  environments  are  that  they  are  used  to  represent  and  develop  other 
programs,  and  to  manipulate  these  other  programs. 

In  general  a  homogenous  representation  of  a  programming  language  makes  it 
easier  to  create  an  interactive  integrated  programming  environment.  Smalltalk  and 
Lisp  are  examples  of  this.  The  languages  are  based  on  a  relative  small  number  of 
consistent  abstractions,  and  seek  to  provide  uniform  treatment  of  different  kinds  of 
information:  text,  graphics,  symbols,  and  numbers. 

Object  oriented  programming  languages  do  not  give  technical  advantages,  but 
cross  i  threshold  cf  perception  and  make  it  easier  for  the  human  being  to  solve  new 
and  complex  problems.  To  be  a  true  object  oriented  programming  language  four 
features  must  be  supported:  information  hiding,  data  abstraction,  dynamic  binding,  and 
inheritance  hierarchy.  A  result  of'  these  features  is  that  this  thesis  supports  the 


paradigm  as  the  most  appropriate  for  object  oriented 
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Smalltalk- V  is  an  example  of  this  kind  of  language,  but  in  addition  Smaiitaik-X"  ;s  an 
i.teravt.ve  integrated  rrccrammmc  environment  m  itself 


Srr.alltalk-SO  :s  both  a  programming  language  and  an  interactive  integrated 
programming  environment.  The  Smalltalk-SO  system  has  the  feature  that  it  can  mod::;, 
itself  and  therebv  produce  a  ne'.v  interactive  integrated  programming  environment 
adapted  to  the  user  s  needs.  It  can  simulate  the  new  environment  in  the  existing  i c id > 
environment,  then  use  the  simulation  to  actually  produce  the  new  system. 


B.  RECOMMENDATIONS 


i.  What  Can  be  Done  Now 

In  order  to  make  the  user  feel  more  comfortable  with  his  her  programming 
environment,  new  applications  should  build  on  skills  the  user  already  has  instead  of 
forcing  him  her  to  learn  new  skills.  The  user  should  stay  in  control  of  the  computer 
throughout  the  session.  Emphasis  should  be  put  on  responsiveness,  permissiveness  and 
consistency  when  the  user  interface  is  designed.  A  good  naming  convention  can  help 
the  user  to  easily  get  the  purpose  of  the  functions,  routines,  objects,  etc. 

Interactive  integrated  programming  environments  based  on  interpreted 
programming  languages  like  Lisp  and  Smalltalk  seem  to  be  best  suited  for 
environments  producing  new  programs,  and  not  so  much  for  pure  execution  cf 
computation  heavy  applications. 


2.  Future  Research  Areas 

The  interactive  integrated  programming  environment  of  the  future  should  pay 
more  attention  to  the  total  project  development  problem.  Therefore  it  should  ..it  least 
be  developed  m  the  foiiowmg  directions: 

1.  The  design  and  programming  tools  should  be  even  more  integrated. 

2.  A  good  database  system  to  keep  track  of  versions  etc.  is  needed. 

The  project  management  tools  must  be  integrated  with  the  rest  of  the  svstem. 
including  the  database. 

4.  Tools  to  perform  semantic  aruhsis  during  programming  is  needed 

'  Reduce  or  el. ruinate  the  semantic  differences  between  di.Terent 

languages  environments.  Consistent  use  of  commands  etc. 

Beth  Lisp  and  Smalltalk  let  the  user  modify  his  her  cwn  environment. 
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well  defined  principles  for  the  user  interlace  in 


•rear  *o  reduce  the  burden  on  the  user  even  more.  Detailed  specifications 
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APPENDIX  A 

SMALLTALK-80  TERMINOLOGY 


This  appendix  contains 
thests.  The  definitions  are 
Implementation'  by  Adele  Gel 

•  ABSTRACT-CLASS 

•  CLASS 

•  INSTANCE 

•  INSTANCE  VARIABLE 

•  INTERFACES 

•  MESSAGE 

•  MESSAGE  SELECTOR 

•  ML  T  ACL  ASS 

•  METHOD 


a  summary  of  Smaiita!k-SO  s  terminology  used  m  this 
taken  from:  Smalitalk-SO:  The  Language  and  its 
j'oerc  ana  David  Robson  [Ref.  ?0j. 


A  class  that  specifies  protocol,  but  is  not  lIpIv  to 
fully  implement  it,  by  convention,  instances  are 
not  created  of  this  kind  of  class. 


An  object  that  describes  the  implementation  of  a 
set  of  similar  objects. 

One  of  the  objects  described  by  a  class.  ;;  has 
memory  and  responds  to  messages. 


A  variable  available  to  a  single  object  for  the 
entire  lifetime  of  the  object;  instance  variables  van 
be  named  or  indexed. 

the  set  of  messages  to  which  an  object  .  in 
respond.  The  only  way  to  interact  with  an  object  > 
through  its  interface. 


A  request  for  an  object  to  carry  out  one  of  its 
operations. 

The  name  of  the  type  of  operation  a  message 
requests  of  its  receiver. 

I  he  class  of  a  class 

A  procedure  describing  hew  to  perform  one  of  an 
object  s  operations;  it  is  made  up  of  a  message 
pattern,  temporary  variable  declaration,  md  a 
sequence  of  expressions.  A  method  ;s  exe. uted 
when  a  message  matching  its  message  p.:"orn  ;s 
sent  to  an  instance  of  the  class  m  v.;d;  the 
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•  OBJECT 


A  component  of  the  Smalltalk-So  system 
represented  by  seme  private  memory  and  a  set  of 
operations. 


•  OVERRIDING 


Specifying  a  method  in  a  subclass  for  the  same 
message  as  a  method  in  a  superclass. 


•  PRIMITIVE 


An  operation  performed  directly  by  the 
smalIta!k-SO  virtual  machine. 


•  RECEIVER 


The  object  to  which  a  message  is  sent. 


•  SUBCLASS 


A  class  that  inherits  variables  and  methods 
an  existing  class. 


•  SUPERCLASS 


The  class  from  which  variables  and  met 
inherited. 


•  SYSTEM 


The  set  of  classes  that  come  with  the  S: 
svstern. 
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APPENDIX  B 

TOWER-OF-HANOI  IN  PROLOG 


Tower-Of-Hanoi  program  in  Advanced  A.I.  Systems'  Prolog,  Version  M-1.13.  In 
order  to  run  the  program  you  must  specify  the  number  of  disks,  and  the  names  of  the 
three  poles  used. 


hano i < 0 , From ,To , Other )  !. 

hanoi  <Dtsk, From, To, Other) 

D i sk 1  i s  0 1 sk-1  , 

hanoi<Di ski, From, Other, To), 

moveCDi sk , From  , To) , 

hanoi <Di ski .Other, To, From). 

moveCDi sk .From, To) 

n 1 ,wr i te < 'Move  disk  ').write(K), 
writet"  -from  pole  ' )  ,wr  i  te(From)  , 
write<-'  to  pole  ' )  ,wr  i  te(To)  . 

hanoi <3,polel ,pole2,pole3) . 


V.  nothing  to  do . 

V.  move  top  Disk-1  aside 
V.  move  Disk 

V.  move  top  Disk-1  back 


APPENDIX  C 

TOWER-OF-HANOI  IN  LISP 


The  Tower-Of-Hanoi  program  builds  on  a 
and  Horn  [Ref.  20:  p.  88-90], 


program  found  in  "LISP"  by  Winston 


(defun  Hanoi  nil  (Transfer  ‘1  ‘3  ‘2  (read))) 
(defun  MoveOisk  (From  To) 

(print  (list  From  ->  To)) 

(terpri)) 

(defun  Transfer  (From  To  Using  Height) 
(cond ((equal  Height  I) 

(MoveOisk  From  To)) 

(t  (Transfer  from 
Using 
To 

(sub  1  Height)) 

(MoveOisk  From  To) 

(Transfer  Using 
To 

From 

(sub  I  Height))))) 


,K  disks  on  I  first. 

.Print  instruction. 
.Start  new  line 


.Transfer  one  disk 
.Move  from  From 
,to  Using 

.using  To  as  space 
;(K  -  l )  disks. 

.Move  lowest  disk 
.Move  from  Using 
;  to  To 

.using  From  as  space 
,(K  -  i )  disks 


APPENDIX  D 

TOWER-OF-HANOl  IN  PASCAL 


The  Tower-Of-Hanoi  program  is  based  on  a  program  in  "A  Taste  of  Smalltalk* 
by  Ted  Kaehler  and  Dave  Patterson  [Ref.  49:  p.  3-4],  MacPascal  version  1.0  is  used. 


program  To wersOf Hanoi; 

var 

howMany  :  integer;  (Number  of  disks) 
procedure  MoveTower  (Height,  FromPole,  ToPole,  UsingPole  :  integer); 
procedure  MoveDisk  (FromPole,  ToPole  :  integer), 

begin 

writelnCFromPole*,  FromPole  :  3, '  ToPole*,  ToPole  .  3); 

end, 

begin  (MoveTower) 
if  height  >  0  then 
begin 

MoveTower(Height  -  1,  FromPole,  UsingPole,  ToPole); 
MoveDisk(FromPole,  ToPole); 

MoveTower(Height  *  1,  UsingPole,  ToPole,  FromPole); 

end, 

end,(MoveTower) 
begin  (TowersOf Hanoi) 
showtext, 

writelnCHow  many  disks  do  you  want?’); 
readln(howMany); 

MoveTowerthowMany,  I,  3,  2); 
end  (T  o  wersOf  Hanoi ) 
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APPENDIX  E 

TOWER-OF-HANOI  IN  SMALLTALK-80 

The  Tower-Of-Hanoi  program  is  taken  from  "A  Taste  of  Smalltalk”  by  Ted 
Kachlcr  and  Dave  Patterson  I  Ref.  49). 


*  METHOD  MoveDiskto: 


moveDisk:  fromPoJe  to.  toPole 
'Move  Disk  from  o  pole  to  Another  pole. 
Print  results  in  the  transcript  window' 
Transcript  cr. 

Tronscript  show:  (fromPole 
printstring,  toPole  printstring). 


*  METHOD  moveTower:  from:  to:  using. 


moveTower:  height  from:  fromPole  to:  toPole  using:  usingPole 
'Recursive  procedure  to  move  the  disk  at  a  height  from  one  pole  to  another 
using  a  third  pin' 

(height  >  0)  ifTrue.  I 
self  moveTower:  (height  -  1)  from: 

fromPole  to:  usingPole  using:  toPole. 
self  moveDisk.  fromPole  to:  toPole. 
self  moveTower:  (height  -  1)  from: 
usingPole  to:  toPole  using.  fromPole) 

'Run  the  program  by  selecting  and  choosing  do  it' . 

(Object  new)  moveTower:  3  from:  1  to.  3  using.  2* 
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